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Abstract 

The properties of vehicle time headways are fundamental in many traffic 
engineering applications, such as capacity and level of service studies on 
highways, unsignalized intersections, and roundabouts. The operation of 
modern vehicle-actuated traffic signals is based on the measurement of time 
headways in the arriving traffic fiow. In addition, the vehicle generation 
in traffic fiow simulation is usually based on some theoretical vehicle time 
headway model. 

The statistical analysis of vehicle time headways has been inadequate 
in three important aspects: 1) There has been no standard procedure to 
collect headway data and to describe their statistical properties. 2) The 
goodness-of-fit tests have been either powerless or infeasible. 3) Test results 
from multi-sample data have not been combined properly. 

A four-stage identification process is suggested to describe the headway 
data and to compare it with theoretical distributions. The process includes 
the estimation of the probability density function, the hazard function, the 
coefficient of variation, and the squared skewness and the kurtosis. The 
four-stage identification process effectively describes those properties of the 
distribution that are most helpful in selecting a theoretical headway model. 

One of the major problems in the headway studies has been the method 
for goodness-of-fit tests. The two most commonly used tests are the chi- 
square test and the nonparametric Kolmogorov-Smirnov test. The chi- 
square test is not very powerful. The nonparametric Kolmogorov-Smirnov 
test should be applied only, when the parameters of the distribution are 
known. If the parameters are estimated from the data, as in typical headway 
studies, the nonparametric Kolmogorov-Smirnov test gives too conservative 
results. These problems are addressed by parametric goodness-of-fit tests 
based on Monte Carlo methods. 

Another great problem has been the lack of theoretical foundation in 
dealing with multi-sample data. The headway data usually consist of several 
samples, and the null hypothesis is tested against each of them. Two meth¬ 
ods are presented to strengthen the evidence of multi-sample tests: 1) The 
combined probability method gives a single significance probability based 



on several independent tests. 2) The moving probability method is used to 
describe the variation of combined probabilities against traffic volume. 

These methods were applied to time headway data from Finnish two- 
lane two-way roads. The independence of consecutive headways was tested 
using autocorrelation analysis, runs tests and goodness-of-fit tests for the 
geometric bunch size distribution. The results indicate that the renewal 
hypothesis should not be accepted in all traffic situations. This conclusion 
is partly supported by a further analysis of previous studies. 

Five theoretical distributions were tested for goodness of fit: the negative 
exponential distribution, the shifted exponential distribution, the gamma 
distribution, the lognormal distribution and the semi-Poisson distribution. 
None of these passed the tests. In the parameter estimation the maximum 
likelihood method was preferred. For distributions having a location (thresh¬ 
old) parameter, a modified maximum likelihood method was shown to give 
good estimates. 

The proposed procedures give a scientific foundation to identify and esti¬ 
mate statistical models for vehicle time headways, and to test the goodness 
of fit. It is shown that the statistical methods in the analysis of vehicle 
headways should be thoroughly revised following the guidelines presented 
here. 
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Best linear unbiased estimator 
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Sample coefficient of variation 
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Preface 


The work started with the aim of finding more effective methods for vehicle- 
actuated traffic signal control on two-lane roads. Part of the project was 
the development of a traffic signal control simulation program. In order to 
find a suitable headway variate generator, some field measurements were 
performed on two-lane roads with speed limits not greater than 70 km/h. 
(In Finland traffic signal control is not allowed on roads with higher speed 
limits.) The location and time of each measurement was selected so that 
the expected flow rate would not be much above 1,000 veh/h. 

The analysis of the vehicle time headway data soon appeared to be a 
far more complex issue than was initially assumed. It became evident that 
the methods commonly used in the literature were not very efficient, and 
in many cases they were infeasible. Consequently, the focus of the study 
shifted from traffic signal control to the statistical analysis of vehicle head¬ 
ways. Some shortcomings in this report are due to this fact. When the 
field measurements were made, the extent and type of the analyses to be 
performed was not anticipated. 

Many of the theoretical distributions and their basic properties have been 
described in an introductory paper (Luttinen 1990). The major results have 
been published earlier: the goodness-of-fit tests and parameter estimation 
for the gamma distribution (Luttinen 1991); the basic statistical properties 
of time headways (Luttinen 1992); and the methods of identification, es¬ 
timation, and goodness-of-fit tests (Luttinen 1994). The comments of the 
reviewers of these papers have been very helpful. 

The traffic analyzer measurements were performed by the Laboratory of 
Transportation Engineering at the Helsinki University of Technology. My 
special thanks to Mr. Kari Hintikka for all the care he has taken with the 
measurements. I gratefully acknowledge professors Arde Faghri, Sulevi Lyly, 
Hannu Nierni and Matti Pursula for their advice and support. Professor 
Faghri has given very valuable help to make the purpose and style of the 
text more clear. Professor Lyly has helped me in many practical matters. 
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Finally, my deepest thanks to my wife Tarja, to my children Mikko, 
Hanna, and Jaakko, and to my parents Raili and Matti for their continuous 
support during the past years. Without their patience and encouragement 
this report would never have seen the daylight. 

Lahti, Finland 

April 28, 1996 

R. Tapio Luttinen 



Chapter 1 

Introduction 


1.1 Definitions 

Time headway , or headway for short, is the time interval between two vehi¬ 
cles passing a point as measured from the front bumper to the front bumper. 
A headway is accordingly the sum of the time used by a vehicle to pass the 
observation point (occupancy time ) and the time interval (gap) to the arrival 
of the next vehicle. 

In platoons the gap is usually more meaningful to the driver than the 
headway. Headways are, however, more important in the traffic fiow theory. 
Because the flow rate 1 is reciprocal to the mean headway, the study of 
headways is closely connected to the study of highway capacity. In selecting 
a headway a driver establishes a flow rate for the vehicle (Dawson & Chimini 
1968). 

The headways are measured in time rather than in space. Time head¬ 
ways, rather than space headways or spacings between consecutive vehicles, 
have been selected as the subject because: 

1. Time headways and flow rates are directly connected. There is a simi¬ 
lar connection between spacings and densities, but flow rates are usu¬ 
ally more meaningful and more easily measured by practicing traffic 
engineers. 

2. In platoons the driver of a trailing vehicle adjusts the gap considering 
safety, one aspect of which is the reaction time. It can be assumed 

1 The flow rate or traffic flow means the expected number of vehicles per time unit 
passing the reference point. The traffic volume is the measured number of vehicles per 
time unit. In HCM (1994) the terms have been used in a different sense. 
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CHAPTER 1. INTRODUCTION 


that the gap, and hence the headway of a trailing vehicle, is not as 
sensitive to vehicle speeds as the spacing. 

3. An estimate of the spacing can be obtained as the product of speed 
and headway, assuming constant speed during the headway. 

In the literature on point processes the counting specification is often 
taken as primary, because it allows a generalization to higher dimensional 
spaces (e.g., Daley & Vere-Jones 1988). This study, however, discusses the 
traffic flow as a process on the real line. For this purpose the interval 
specification is more informative. 

1.2 Importance of headway studies 

Many traffic operations (such as passing, merging, and crossing) depend 
on the availability of large headways in the traffic flow. Consequently, the 
distribution of headways has an effect on platoon formation and delays. 

In the 1985 Highway Capacity Manual (HCM 1985) the level of service 
on two-lane rural highways is approximated by the proportion of headways 
less than five seconds—thus making a connection between the headway dis¬ 
tribution and the level of service. In addition, the headways play a crucial 
role in the analysis of unsignalized intersections and roundabouts (Sullivan 
& Troutbeck 1994). 

Although the average delay in fixed time traffic signals is not very sensi¬ 
tive to the detailed stochastic properties of the arrival model (Newell 1956, 
Newell 1965, Kingman 1962), in vehicle actuated traffic signals the con¬ 
trol, the extension time in particular, is very sensitive to the arrival pattern 
(Darroch, Newell & Morris 1964). The characteristics of vehicle headways 
are accordingly quite relevant in the study of optimal traffic signal control. 

The mathematical analysis and simulation of traffic operations are based 
on theoretical models, which must be evaluated against the properties of real 
world data. More advanced methods have increased the need for reliable 
knowledge of the statistical properties of vehicle headways. 

1.3 History and problems 

Traffic ffow has been considered as a stochastic process at least since Adams 
(1936). He formulated the idea of arrivals as a random (i.e., Poisson) process 
and verified good agreement with theory and observations. Since then many 
more sophisticated models have been proposed. 
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The properties of headways have been extensively studied, especially in 
the 60’s. Some of this earlier work is now drawn together and evaluated in 
the light of recent data and more powerful statistical methods. 

No unified framework has been developed for traffic flow theory. Accord¬ 
ingly, the vehicle headway studies have been concentrated on the statistical 
analysis of headway data. There have been, however, some notable excep¬ 
tions. Greenberg (1966) has found a connection between the microscopic 
traffic flow theory and the lognormal follower headway distribution. 

An interesting new approach has been developed by Heidemann (1990, 
1993). He has applied the theory of stochastic processes to analytically 
derive the headway distribution as a function of traffic density. Heidemann 
criticizes the statistical approach for lacking in detailed analytical reasons 
that would justify the various types of distributions. His approach may 
provide a link between the macroscopic traffic flow theory and the headway 
distributions, but so far it has not found many applications. The distribution 
function is rather complicated, and its statistical properties have not been 
widely explored. The Heidemann model is not discussed below, but it well 
deserves further studies. 

The statistical analysis of vehicle headways has been inadequate in three 
important aspects: 

1. There has been no standard procedures to collect headway data and 
to describe their statistical properties. 

2. The goodness-of-fit tests have been either powerless or infeasible. 

3. Test results from several samples have not been combined properly. 

In the literature on vehicle headways the shape of the empirical (sam¬ 
ple) distributions have been mostly described by cumulative distribution 
functions and histograms. In some studies measures such as the mode 
(May 1961, Summala & Vierimaa 1980, Rajalin & Hassel 1992), the median 
(May 1961) and the coefficient of variation (May 1965, Buckley 1968, Dunne, 
Rothery & Potts 1968, Breinran, Lawrence, Goodwin & Bailey 1977, Pur- 
sula & Sainio 1985, Griffiths & Hunt 1991) have been presented. The re¬ 
newal hypothesis (mutually independent and identically distributed head¬ 
ways) has also drawn the attention of several researchers (e.g., Dunne et al. 
1968, Breinran, Gafarian, Lichtenstein & Murthy 1969, Cowan 1975, Chris- 
sikopoulos, Darzentas & McDowell 1982). 
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The work of Breiman et al. (1969) deserves special attention. They have 
made a serious attempt to standardize the analysis of headway data by 
presenting a method to determine trendless time periods in the traffic flow, 
and by introducing a set of three tests for the renewal hypothesis. The idea 
of trendless data has not, however, been unanimously accepted. In addition, 
the comparison of the headway data presented by various authors is difficult, 
because no standard set of analyses has been applied to identify the most 
important properties of the headway data. 

The goodness of fit has been tested mainly by three methods: (1) graph¬ 
ical evaluation, (2) the chi-square test, and (3) the Kolmogorov-Smirnov 
test. The graphical evaluation has been based on the probability distribu¬ 
tion function (e.g., Akgelik & Chung 1994) or the probability density func¬ 
tion (e.g., Buckley 1968). These methods are highly subjective and should 
be used only in preliminary studies. The use of histograms to estimate the 
probability density function has sometimes even produced a distorted idea 
of the distribution. 2 

The chi-square test is theoretically justified, but it is based on the re¬ 
duction of observations into counts, which decreases the power of the test 
considerably. The Kolmogorov-Smirnov test has been even more problem¬ 
atic. It has been used in its nonparametric form, which is based on the 
assumption that the parameters of the distribution are known. In the head¬ 
way studies the parameters are, however, usually estimated from the data. 
Because the estimation makes the fit better, the critical values should be 
smaller. This correction has not been made in any of the studies reviewed. 
Consequently, the results have been too conservative. The problems asso¬ 
ciated with the goodness-of-fit tests have not received proper attention in 
previous studies. 

Another great problem has been the lack of theoretical foundation in 
dealing with multi-sample data. The headway data usually consist of several 
samples, and the null hypothesis is tested against each sample. The result 
has often been presented as the number or proportion of tests which have 
led to the rejection of the null hypothesis (e.g., Buckley 1968, Breiman et al. 
1969). In some cases even the average significance probability has been used 
(e.g., Branston 1976). In none of the studies reviewed have the individual 
tests been properly combined to give a scientifically valid evaluation of the 
overall acceptability of the hypothesis. 


2 The figure on page 459 of Griffiths & Hunt (1991) gives the impression of a bell shaped 
probability density function that is not supported by their equation 1. 
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1.4 Purpose of the study 

The scope of this study is the statistical analysis of vehicle headways on two- 
lane two-way roads. The headways on both lanes are analyzed separately. 
The study has two purposes: 

1. To introduce a statistical methodology for the analysis of the headway 
data and for the evaluation of the proposed models 

2. To get information about the statistical properties of headways on two- 
lane two-way roads. 

The main objective is to present proper statistical methods for the goodness- 
of-fit tests and for the combination of significance probabilities. The second 
objective is to present a set of methods to effectively describe the statistical 
properties of headway data. These tests and methods will be applied to 
some of the most commonly used headway distribution models. Combined 
with the discussion on the data collection and parameter estimation, the 
text presents a scientific foundation for the statistical analysis of vehicle 
headways. 

Although the data are from two-lane two-way roads, the statistical meth¬ 
ods presented can be applied to any headway data. Some methods have even 
more general interest. 

1.5 Summary of contents 

The presentation follows the statistical modeling process in figure 1.1. Chap¬ 
ter 2 reviews the microscopic traffic ffow theory in the light of vehicle head¬ 
ways. There is also a short introduction to the main statistical methods 
used. The problem of combining the test results from several samples is 
solved by the method of combined probabilities (Fisher 1938). The moving 
probability method developed by Luttinen (1992) allows the analysis of test 
results over another variable, such as traffic volume. 

Chapter 3 discusses the measurements and the preliminary analysis of 
the Finnish headway data. The methods introduced in earlier literature to 
obtain stationary headway data are evaluated. Because the combination of 
samples introduces a bias in the data, data collection based on trend analysis 
gives better results. 

Chapter 4 introduces methods for model identification. In addition to 
the conventional probability distribution function and the probability den¬ 
sity function, the shape of the headway distribution is analyzed using the 
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Figure 1.1: Statistical modeling process 


hazard function, which has been extensively applied in life and reliability 
data analyses. The first four moments are shown to be very informative 
measures of location and dispersion. On the basis of these analyses a four- 
stage identification process (Luttinen 1994) is introduced. In addition, three 
methods are used to test, whether the renewal hypothesis can be accepted 
for the headway data. 

Chapter 5 analyses the applicability of several statistical distributions as 
models for the headway distributions on two-lane two-way roads. Main em¬ 
phasis is on the goodness-of-fit tests and the parameter 3 * * & estimation. Para¬ 
metric goodness-of-fit tests are compared with the nonparanretric Kolnro- 
gorov-Smirnov tests. 

It is assumed that the reader has a basic knowledge of the probability 
theory as well as statistical and numerical methods. The theory is described 
only to the extent that is essential to understand the text. More detailed 
descriptions can be found in the references given. 


3 The parameters are called location, scale and shape parameters following the terminol¬ 
ogy of D’Agostino & Stephens (1986), Johnson, Kotz & Balakrishnan (1994), and Stuart 
& Ord (1987). A parameter r is called the location parameter, if the distribution has the 


form f(t — r), and 9 is called the scale parameter , if the form is 9f(9t) or —f 

0 


(Stuart 


& Ord 1987). If higher moments (skewness and kurtosis) of a distribution are defined by 
a third parameter, it is called the shape parameter. 































Chapter 2 


Theoretical background 


2.1 Microscopic traffic flow theory 

The microscopic theory of traffic flow describes the movements of vehicles 
on road sections. The movement of vehicles on a lane can be displayed 
as a time-space digram (fig. 2.1). The distances between vehicles can be 
studied either at a fixed location (headways) or at a fixed time (spacings) 
or as a dynamic (car-following) model, which is the most comprehensive 
approach. A short review of the car-following models gives a broader view 
on the properties of vehicle headways. 

Let the position of a vehicle j at a moment u be Xj(u). The time instant 
for the vehicle to pass the observation point (x) is: 

ujj{x) = {uj | Xj(u) = x}. (2-1) 

Thus, the time headway of vehicle j at point x is: 

tj(x) = LOj(x) — ujj- i(x). (2-2) 

Assuming constant speed for vehicle j during the headway, the headway 
of vehicle j at time u is: 


tj(u) 


Xj -iM ~Xj{u) 
Xj{u) 


(2.3) 


where the numerator is the spacing of vehicle j at time uj, and the dot 
denotes differentiation with respect to time. This equation gives the ba¬ 
sic relationship between time headways, space headways, and car-following 
models. 
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Position 



Figure 2.1: Time-space diagram of vehicle trajectories 

Let us assume that the driver of vehicle j has reaction time Rj. The 
length of the vehicle ahead is Lj—\. The driver of vehicle j wants to keep a 
net safety spacing Bj in front of the vehicle. This spacing allows the driver 
to avoid too strong decelerations. The desired headway is then (Daou 1966): 

t j = R j + Lj ~* + Bj . (2.4) 

Xj-l 

Daou analyzed over 24,000 headways from the Holland Tunnel and obtained 
least squares estimates for reaction time Rj = 1.488 s and (gross) safety 
spacing Lj_i + Bj = 35.0 ft (10.7 nr). 

According to this theory, there is a correlation between desired headways 
and speeds as shown in figure 2.2. At higher speeds the follower headways 
are shorter. 

In a car-following model the driver of a following vehicle can control the 
vehicle by accelerating and decelerating. The researchers in the General 
Motors (see May 1990) developed several models based on the assumption 
that the response (control) is a function of stimuli and sensitivity. 

According to the linear car-following model the response is a linear 
function of the speed difference between the vehicles (Chandler, Herman 
& Montroll 1958): 

Xj (lo + Rj) = cco [xj -i (w) — Xj (u;)] • (2-5) 

Because the response does not depend on the headway or the spacing be¬ 
tween vehicles, the model is unrealistic, and it cannot describe the desired 
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Headway [s] 



Figure 2.2: Speed and desired headway according to Daou (1966) 


headway. 

Gazis, Herman & Potts (1959) have proposed a nonlinear car-following 
model: 

OL\ 

<“ + Ri) = Xj _L (w) - Xj (u) [± ’-' - ** <U,)1 • (2 ' 6) 

where the sensitivity is a function of a parameter (aq) and the spacing, and 
the stimulus is the speed difference between vehicles j — 1 and j. 

The model produces different equilibrium headways for different initial 
headways and relative speeds. Long initial headways and low initial relative 
speeds result in long equilibrium headways. 

The headway of a vehicle approaching the vehicle ahead is shown in figure 
2.3. The leading vehicle has constant speed of 80 km/h. The follower starts 
the car-following at initial headway 10 s and speed 100 km/h. Reaction 
time is 1.5 seconds. Sensitivity coefficients (on) are 3.0 m/s (upper curve), 
2.6 m/s (middle curve), and 2.0 m/s (lower curve). The straight line shows 
the headway if the follower maintains constant speed (100 km/h). 

Gazis, Herman & Rothery (1961) have further modified the nonlinear 
model: 

%rn ( w 4 - A 

Xj (w + Rj) = a 2 - - 3 — - ] —T [Xj-i M - Xj M] . (2.7) 

[Xj- 1 (W) - Xj (w)J 

Increasing l makes the model less sensitive to the initial headway, because 
the response is more strongly inversely proportional to the spacing. The 
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Figure 2.3: Headway as a function of time, when a vehicle is approaching the 
vehicle ahead according to the nonlinear car-following model (2.6) 


response is rather weak until the spacing between vehicles is short. A larger 
value of m makes the steady state headway less dependent on the initial 
speed difference, because the sensitivity is proportional to the speed of the 
follower. 

On the basis of the car-following models the variation in the follower 
headways can be explained by seven factors: 

1. Differences in the sensitivity between followers 

2. Delayed responses to speed variations of the leading car 

3. Oscillation due to strong and delayed responses 

4. Variation in the length of leading vehicles 

5. Approaching (transition) phase in the beginning of the car-following 
process 

6 . Variation in the initial headway, when the car-following process starts 

7. Variation in the initial speed difference, when the car-following process 
starts. 

Factors 1-4 describe the typical car-following process, when a vehicle has 
finished the approaching phase and is following the leading vehicle. Factors 
5-7 describe the situation, when a vehicle approaches the leading vehicle 
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and shifts from the free-moving state to the car-following state. Factors 1, 
6 , and 7 describe different types of drivers. 

The car-following models have serious limitations in headway studies: 

1. Even in a reasonably dense traffic some drivers will not attempt to 
follow the car preceding them (Herman & Potts 1961), because the 
desired speed of a driver can be lower than the speed of the vehicle 
ahead. The models can only be applied to vehicles that are following 
another vehicle. 

2 . At long spacings the speed difference does not inffuence the follower. 
The models do not, however, have a threshold for car-following and 
free driving. 

3. Some short headways are caused by passing vehicles. The models do 
not, however, describe passing. 

4. The models developed by the researchers in the General Motors do not 
have a connection between speed and spacing. The equilibrium head¬ 
way is different depending on the initial speed of the follower (fig. 2.3). 
In addition, the models do not allow the possibility of differences in 
the “desired spacing” among followers. 

5. According to Leutzbach (1988) “the perceptual threshold for positive 
speed differences (the spacing decreases) is smaller than that for neg¬ 
ative speed differences (the spacing increases)”. The models have the 
same sensitivity for negative and positive speed differences. 

6 . The car-following theory does not deal with the inherent fuzziness of 
the car-following process. The follower cannot observe and does not 
continuously react to variations smaller than a perception threshold 
in the spacing or the speed of the leading car (Leutzbach 1988). This 
“acceleration noise” causes random variation in the process. 

7. A driver’s reaction pattern is imprecise, but the reaction of the follower 
is deterministic in the models (Kikuchi & Chakroborty 1992). 

These considerations have led to psycho-physical 1 (Leutzbach 1988) and 
fuzzy (Kikuchi & Chakroborty 1992) car-following models. 

1 Psycho-physiologisch in German. 
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2.2 Statistical methods 

2.2.1 Sampling techniques 

Headway (t t ) is the time between successive vehicles (i — 1 and i) as they 
pass a point on a lane or roadway measured from front bumper to front 
bumper (HCM 1994). The mean headway (t) and the traffic volume (A) in 
a sample are connected in an obvious way: 

~ 71 I 

A=^— = ?> (2.8) 

2-a=l T i 1 

where n is the number of observations in the sample. Because the expected 
traffic volume is, by definition, the ffow rate, the expected mean headway 
at ffow rate (A) is: 

E(i\\) = j. (2.9) 

In order to get generally applicable results about the properties of head¬ 
ways, it is necessary to study stationary conditions. 2 Consequently, all 
nonrandom variation must be removed from the measurements as far as 
possible. Otherwise the parameters of the distribution change as a function 
of time. 

In previous studies mainly two approaches have been used to overcome 
the problem of nonrandom variation: The samples have been collected either 
as fixed time slices or on the basis of trend analysis. Quite often the method 
of collecting and analyzing the data is not specified. 

In some studies the measurements are investigated in fixed time slices 
of length short enough to exclude any significant trend (typically 30 s to 
10 min). It is, however, possible to have a significant trend in a 5-10 minute 
time period. One minute period, on the other hand, is too short, at least 
under light flow conditions. 

As the time period (S n , sum of sample headways) gets shorter, sample 
mean (t) longer or standard deviation (s) larger, the standard error in the 


2 The stationarity is considered to be equivalent to trendlessness. It is possible that 
the process is nonstationary, although the flow rate remains constant. This possibility 
is, however, ignored, as suggested by Cox & Lewis (1966).- Because traffic flow is a 
dynamic process in time and space, stationarity can be defined both in time and in space 
(McLean 1989). Only stationarity in time is considered here. 
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mean headway estimate increases: 


s.e.[fl{T)} 



= s 



( 2 . 10 ) 


Under light flow conditions the number of observations per period is small. 
The growth of the standard deviation as the mean headway increases 3 makes 
the mean estimate even more unreliable. As Botrna (1986) has noted, the 
number of long headways in small samples is a highly fluctuating quantity. 

Because the number of headways in a short observation period is usually 
too small for testing goodness of fit, it is necessary to group samples having 
nearly equal means. This may result in a sampling error as the sample means 
have an inappropriate distribution. The artificial homogenization of the 
data leads to the narrowing of the headway distributions, and it can happen 
that the tails of the underlying distributions are undersurveyed (Heidemann 
1990). According to Greenberg (1966) the stratification of headways by flow, 
density or other criteria will introduce a truncation into the data. 

Consider m samples, each sample having size n t and mean and variance 
equal to: 


ti — / , tik 

rii 

* k =l 

fii 

^ X ^ik ti) j 


= 


rii 


k= i 


( 2 . 11 ) 

( 2 . 12 ) 


where the fcth headway in sample i. The combined sample mean is: 


t= - y 'nitj, (2.13) 

n z —' 

i=l 

where 

m 

n = y^ j n i . 

1=1 


3 See section 4.3.3. 


(2.13a) 
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The variance of the combined sample is: 


s 2 - - 
n 


( tik t 


r\2 


i= 1 k= 1 

m rii 


Tl 


^ ^ ^ ^ (^2/c ti ti t 


T \2 


2=1 fc=l 

771 r 


^ m r rij rti 

- y: n; (ii - f ) 2 + ^ (tjfc - tj) 2 + 2 (ti - f) y (tik - ii) 

2=1 ^ /c=l /c=l 


(2.14) 
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Consequently, the variance of the combined sample is the variance of sam¬ 
ple means plus the mean of sample variances. If true mean of the combined 
sample is used, the mean of sample variances can be assumed rather con¬ 
stant. The range of sample means, however, should be defined so that the 
variance of sample means is not biased. Too narrow (wide) range results in 
underestimation (overestinration) of the variance of the headways. 

By the central limit theorem, the distribution of sample means from a 
population with mean /i and variance a 2 approaches normal distribution 
with mean /i and variance cr 2 /n, when the sample size (n) is large (Stuart & 
Ord 1987). If the flow rate is changing during the time of measurement, the 
sample means between the selected lower (a) and upper ( b ) limits can be 
assumed to be uniformly distributed. The range can be selected so that the 
variance [(b — a) 2 /12] of the sample means is realistic, but the distribution 
of the sample means is still biased. 

The other approach, adopted by Dunne et al. (1968), and by Breiman 
et al. (1969), is to analyze several long time periods showing no trend. If 
the number of observations per sample is large, the estimate of the mean is 
more reliable. It is not necessary to group samples. Greater variation and 
serial correlation is allowed without a strong impact on the mean estimate. 
Longer sampling periods enable the detection of lower frequencies in the 
spectral analysis, if it is performed. There is, however, a loss of data, because 
nonstationary time periods are discarded. 

Despite the use of trend tests, there is a possibility of short deterministic 
patterns in the traffic flow. This additional component of variation may 
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cause a small increase in the variance of the headway data (Heidenrann 
1990). 

2.2.2 Trend tests 

In order to obtain trendless samples, it is necessary to subject the data to 
trend tests. Three tests are evaluated: 

1. Weighted sign test 

2. Kendall’s rank correlation test 

3. Exponential ordered scores test. 

Cox & Stuart (1955) suggested the weighted sign test as a quick and simple 
but less efficient alternative to the rank correlation test. For a pair { t t , tj \ 
i < j } of observations a score is defined: 


1, if tj. > tj 
0, if U < tj. 


The test statistic is a weighted number of points of decrease: 

l n 

S — ^ ' (u 2k T X)hk^ n —kJ r \. 

k= 1 


(2.15) 


(2.16) 


Under the null hypothesis (no trend) the observations are not correlated, 
and S is asymptotically normal (Brockwell & Davis 1987) with mean and 
variance equal to: 

h(S) = \n 2 (2.17) 

<r 2 (S) = ^ n ( n2 “ !)• ( 2 - 18 ) 

The Kendall’s rank correlation test (Kendall & Ord 1990, Stuart & Ord 
1991) extends the pairwise comparison to all pairs { t t , tj \ i < j }: 

7i—l n 

Q = ^ ^ hij. 

i=l j=i +1 


(2.19) 
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The rank correlation coefficient is: 


4 Q 

n(n — 1) 


( 2 . 20 ) 


Under the null hypothesis (no trend) r tends to normality rapidly (Stuart 
& Ord 1991) with mean and variance given by: 


Mr) = 0 


2(2n + 5) 


9n(n — 1) 


( 2 . 21 ) 

( 2 . 22 ) 


The exponential ordered scores test of Cox & Lewis (1966) attaches the 
score 

Sr,n = —h • • • H- —, r = 1,... ,71 (2.23) 

n n — r + 1 

to the rth order statistic (f( r ), the rth longest headway) in a sample of size 
n. Let s n (i) be the score of headway tj. The test statistic is: 


F = E s -w(*- n y 1 )- ( 2 - 24 ) 

The test statistic is asymptotically normally distributed with mean zero and 
variance 


*\ V ) = ±( i - r ^-) 2 K 2 , n , 

i=i ' ' 

where 

1/11 1\ 

1^2,n — 1-T ( 1-T +-H - ) • 

n — 1 \n n — 1 2/ 


(2.25) 


(2.25a) 


Empirical power curves 4 were evaluated for the three tests by a Monte 
Carlo method with shifted exponential pseudo-random variates generated 
by the following algorithm: 


Tj +1 = T - 

4 See page 36. 


A (a Jj 


t In (U) 


j = 0,1,2,..., 


(2.26) 
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where 


j 

LOj = ^ Ti 

i= 1 

U 0 = o, 


(2.26a) 

(2.26b) 


A (uij) is the current flow rate, r is the location parameter of the shifted 
exponential distribution, and U is a pseudo-random number from the uni¬ 
form distribution (0,1]. The algorithm allows negative random variates to be 
generated. Because negative vehicle headways are unrealizable, the power 
curves for negative trends (displayed on a gray background in figure 2.4) 
should be viewed for comparison only. 

Linear trend was used. Consequently, the average headway at time in¬ 
stant uij is: 

E(T j+l ) = - 1 — = f (1 + Buj), (2.27) 

where Ao is the initial traffic intensity and 9 is the trend factor. Facto¬ 
rial design (Montgomery 1984) was used with Ao = 100 veh/h, 200 veh/h, 
500 veh/h, and 1,000 veh/h, as well as 6 = -500 %/h, -200 %/h, -100 %/h, 
-50 %/h, -10 %/h, 0 %/h, 10 %/h, 50 %/h 100 %/h, 200 %/h, and 500 %/h. 
The sample sizes were 50, 100 and 200, and the number of replicas was 100. 
The two-sided significance level was 0.05. 

Figure 2.4 shows the empirical power curves of the tests based on Monte 
Carlo runs. A test is powerful if its power curve is equal to the significance 
level for trendless samples and near unity for samples with trend. The curves 
are at 0.05, when there is no trend (9 = 0), that is the tests reject 5 % of the 
samples. As the absolute value of the trend increases, the tests can detect 
the trend more efficiently. The exponential ordered scores test is superior to 
the other tests, and the weighted sign test is the least powerful. Although 
the exponential ordered scores test is computationally more strenuous, it 
was selected as the prime method of testing trend. 

According to Stuart (1956) the asymptotic relative efficiencies of the 
weighted sign test and the rank correlation test against the regression co¬ 
efficient test are 0.83 and 0.98, respectively. The empirical power curves in 
figure 2.4 confirm the superiority of the rank correlation test against the 
weighted sign test. 
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- Weighted sign = Rank correlation - Exp. ordered scores 

Figure 2.4: Empirical power curves for three trend tests 


2.2.3 Parameter estimation 
Method of moments 

The parameters of a distribution can be estimated by equating the expres¬ 
sions for the first k population moments to the corresponding expressions for 
the k sample moments, and solving the k parameters from the set of equa¬ 
tions. This method of moments has been widely used in headway studies 
because of its simplicity: Raw data is not required after the first k moments 
have been calculated, and the first moment (mean headway) is directly ob¬ 
tained from the traffic volume (see eq. 2.8). Also, the solution to the set of 
equations is usually simple. 

The method of moments estimators are usually consistent (Dudewicz 
&: Mishra 1988); i.e., the mean squared error of the estimator decreases 
to zero as the sample size increases. The method is, however, usually not 
efficient 5 (Stuart & Ord 1991), except for distributions closely resembling the 
normal (Greenwood & Durand 1960). This is to be expected, because the 
information in the data is reduced to the sample moments. Accordingly, the 
method of moments is usually not appropriate to estimate the parameters 
of skew distributions, like the headway distributions, but it can be used to 
give first approximations for more efficient methods. 

5 The estimator is (asymptotically) efficient if it has in large samples the smallest vari¬ 
ance among all consistent and asymptotically normal estimators. (Stuart & Ord 1991) 
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Maximum likelihood method 

The maximum likelihood method is the other traditional method to estimate 
the parameters (6 = 9i,...,9 k ) of a distribution. Let us assume that the 
probability density function is /( t \ 9 ). For a sample of n independent 
observations the parameter estimators ( 6 ) are obtained by maximizing the 
likelihood, function: 

n 

m = Uf{u\o). (2.28) 

i =1 

subject to parameter vector 6. In contrast to the method of moments, the 
likelihood function contains all the information in the sample. 

Usually the same estimators can be obtained more easily by finding the 
maximum of the log-likelihood function: 

n 

In L(6) = In /( U 6 ) . (2.29) 

i =1 


A necessary condition for the maximum is that the partial derivatives equate 


to zero: 

dlnL(9) Q 

89 \ 

d In L(6) _ Q 
99 k 


(2.30) 


This set of equations is called the likelihood equations. A sufficient, though 
not a necessary, condition that a solution to these equations is a local max¬ 


imum is: 


d' 2 In L{0) 
89 2 

d 2 ln L{0) 
8 % 



(2.31) 


If there are more than one maxima, the largest one of them is chosen. The 
resulting estimators (9 = 9i,...,9 k ) are the maximum likelihood estima¬ 
tors (MLEs), provided that there is no terminal maximum at the extreme 
permissible values of the parameters. (Stuart & Ord 1991) 
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The maximum likelihood method has several desirable properties. Under 
very general conditions a MLE is consistent, and under certain regularity 
conditions it is efficient. Also, if there is an efficient estimator, the maxi¬ 
mum likelihood method will produce it. Under some general conditions the 
MLE is asymptotically normally distributed with mean equal to the true pa¬ 
rameter value, and among all estimators that are asymptotically normally 
distributed, the MLE possesses the minimum asymptotic variance; i.e. a 
MLE is a best asymptotically normal (BAN) estimator. The properties of 
the MLEs for large samples are, accordingly, excellent. (Stuart & Ord 1991) 


Modified maximum likelihood method 

According to Cohen & Whitten (1988) the first order statistic (the smallest 
observation) of a sample contains more information about the location 6 pa¬ 
rameter (t = 9 1 ; threshold for the minimum headway) than any of the other 
sample observations, often more than all the other observations combined. 
This property can be used to modify the maximum likelihood method. 

The modified maximum likelihood estimators (MMLEs, 6) are obtained 
by substituting the equation of the partial derivative with respect to the 
location parameter (r) with the following equation: 

F (*(i)l®) = F niHi)) = ( 2 - 32 ) 

where is the first order statistic of the sample. The equation can also 
be written in terms of the inverse probability distribution function: 


F~ l 


1 

71+1 



(2.33) 


In some situations the modified method gives better estimators than the 
maximum likelihood method. The modified method has been used to esti¬ 
mate the parameters of the gamma and the lognormal distributions. Cohen 
& Whitten (1980) and Cohen & Whitten (1982) give additional information 
about the properties of MMLEs. 


Minimum chi-square method 

For a sample classified into m mutually exclusive and exhaustive classes, 
the minimum chi-square (MCS) estimators are obtained by minimizing the 


See page 20, footnote 3. 
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chi-square statistic 7 (Cox & Hinkley 1974): 


X 


2 


E 


[m — np( i | 6)} 2 
np( i | 6) 


(2.34) 


where n* is the number of observations in class i, n is the sample size, and 
p{ i | 6 ) is the probability of an observation falling into class i conditional 
upon parameter vector (6). 

Like the MLE, the MCS estimator is also asymptotically efficient and 
a best asymptotically normal (BAN) estimator (Stuart & Ord 1991). The 
MCS method, however, wastes some information because of the data classi¬ 
fication. 

The maximum likelihood method is considered computationally better 
and more effective in small samples than the MCS method. The classifica¬ 
tion of data is troublesome, especially in small samples. Also, the maximum 
likelihood method, unlike the MCS method, gives closed form solutions for 
some distributions. In the analysis of the headway data the maximum like¬ 
lihood method is given preference. When it fails, the modified maximum 
likelihood method is used. 8 


2.2.4 Goodness-of-fit tests 
Testing hypotheses 

Goodness-of-fit tests are used to test the hypothesis that some particular 
distribution F( t \ 6 ) is the true unknown distribution G(t), which has gen¬ 
erated the sample: 

H 0 : G(t) = F(t\0 ). (2.35) 

The null hypothesis (Hq) that the distributions are equal, is tested against 
the alternative hypothesis (Ha) that the distributions are not equal. The 
null hypothesis should be rejected if the distributions differ. 

If F(t\0) is completely specified, the null hypothesis is called simple. 
If the parameters (6) are partly or completely unspecified, Hq is called 
composite —it is composed of various simple hypotheses. In the goodness- 
of-fit tests for headway distributions, Hq is simple, while Ha is composite; 
i.e., the alternative headway distribution is unspecified. It is obvious that 
simple hypotheses are easier to test. 

7 See also page 37. 

8 For the shifted exponential distribution the modified method of moments estimators 
are used, because they are easier to calculate and give practically the same results. 
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The test fails if Hq is rejected while it is true ( type I error), or Hq is 
accepted while it is false ( type II error). The probability of type I error is 
called the significance probability 9 , and the maximum probability of rejecting 
Hq is called the significance level of the test. The probability of type II error 
is difficult to determine in headway studies, because Ha is composite. 

The result of a test is either "Hq is rejected” or "Hq is not rejected”. In 
the first case Ha is accepted, and the significance probability gives the prob¬ 
ability of an erroneous judgment. The second case is interpreted that there 
is not enough evidence to show that Hq is false. This does not necessarily 
mean that Hq is true. Accordingly, it is possible for several distributions to 
pass the goodness-of-fit test, although only one or none of them is the true 
distribution G(t). In the future larger data sets and more powerful tests 
may lead to the rejection of these hypotheses. This is typical of statistics: 
Much can be said about what is (probably 10 ) not true, but very little about 
what is true. 

The ability of a test to detect that Hq is false, is called the power of 
the test. For a composite alternative hypothesis the power depends on the 
particular (simple) models in Ha- Consequently, the power is best described 
as a curve or function, which shows the probability of rejecting Hq as a 
function of simple models in Ha- (Figure 2.4 on page 32 shows the power 
curves for three trend tests.) In an ideal test the probability of rejecting Hq 
while it is true (type I error) is nul, and the probability of rejecting it while 
it is false is unity. 

Because the true distribution G(t) behind a headway sample is unknown, 
G(t) cannot be used in the test, but F(t\0 ) must be tested against the 
properties of the sample: 


Hq : The random sample {t\,..., t n } is generated by F( t \ 6 ). (2.36) 

If the parameters of F(t) are estimated from the sample, the null hypothesis 
becomes: 


Hq : The random sample 


{ti ,..., t n } is generated by f(^ 



(2.37) 


9 Although the term “P-value” is more commonly used, the term “significance proba¬ 
bility” is used here to emphasize the fact that the significance of a test is the probability 
of type I error. 

10 Popper (1983) states this even more strictly: “Probability hypotheses do not rule out 
anything observable... Probability estimates are not falsifiable. Neither, of course, are 
they verifiable.” Probabilistic explanations are, however, important, because “often the 
best that can be established with some warrant is a statistical regularity” (Nagel 1982). 
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In both cases the problem is to find a powerful test statistic (B), which 
measures the goodness of fit. If the distribution Fb(-) of the test statistic 
under Ho is known, the significance probability (p) is the probability that a 
random variate ( B ) from distribution Fb(-) is greater than or equal to b: 

p = P{B >b} = 1 - F B (b). (2.38) 

If Ho is true, the significance probabilities are uniformly (0,1) distributed 
(Fisher 1938). This is the consequence of the probability integral transfor¬ 
mation: If X is a continuous random variate and P{X < x} = F(x), then 
F(X) is distributed according to the uniform (0,1) distribution (Johnson et 
al. 1994). 11 

If the distribution of the test statistic is the same for all distributions and 
their parameters, the statistic is called distribution-free , and the hypothesis 
is called nonparametric 12 . A test using a statistic whose distribution is 
different depending on F( t \ 6 ) is called parametric. Such a test is usually 
more powerful, because it is based on the known properties of the statistical 
model. 


Nonparametric tests 

Chi-square test If a sample follows distribution F(t\B), it can be di¬ 
vided into m exclusive and exhaustive classes A\,..., A m so that the propor¬ 
tion of observations in a class corresponds to the probability of a random 
variate from F(t\Q) falling into the same class. In headway studies an 
obvious classification is a set of consecutive headway ranges: 


{tj € Ai} -<=f- {Tj < tj < Ti- |_i} . 


(2.39) 


11 Consequently, the inverse probability distribution function can be used to generate a 
random variate A' following distribution F(x) from a uniformly [7(0,1) distributed random 
variate Y: 

x = F _ 1 (y). 

12 The terminology follows loosely that of Stuart & Ord (1991). They confine the ad¬ 
jectives “parametric” and “nonparametric” to statistical hypotheses and apply the adjec¬ 
tive “distribution-free” to the test statistic, the distribution of the test statistic, and so 
on. Often the terms “nonparametric” and “distribution-free” are used interchangeably 
(Conover 1980). 
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In this case the probability P{fj € Aj} can be calculated as follows: 
p(i\9) = P {tj £ Ai} 

= P{T* < tj < T i+ 1 } (2.40) 

= F{T i+1 \G)-F{T i \0). 


The goodness of fit can be measured by Pearson’s chi-square statistic: 

,2 = \ n i ~ n P( i I A )] 2 
" np{ i\0) 

where m is the number of classes, n t is the number of observations in class 
i, and n is the sample size. If the null hypothesis is true, n* is a binomial 
random variate with parameters n and p(i \ 9), and expectation E(m ) = 
np(i | 9). The statistic is then asymptotically (i.e., for large n) chi-square 
distributed with v = m — 1 degrees of freedom. 

If some or all of the parameters ( 9 = 9i,...,9k) are estimated from 
the sample, the degrees of freedom becomes v = m — 1 — r, where r is 
the number of estimated parameters. When more parameters are estimated 
from the sample, the fit is bound to be better, and the test statistic must 
be smaller (Lilliefors 1967). 

The significance probability p of the chi-square test is the probability 
that a random variate (x 2 ) following the chi-square distribution is greater 
than or equal to x 2 : 


p = P{x 2 > x 2 } = 1 — F x 2 ( x 2 | v ), (2.42) 

where Fy'i (■) is the probability distribution function (PDF) of the chi-square 
distribution. The asymptotic distribution of x 2 does not depend on the dis¬ 
tribution F( 1 1 9 ) tested or its parameters, but only on u. Consequently, 
asymptotically the test statistic is distribution-free, and the test is nonpara- 
metric. 

Because the test statistic is only asymptotically chi-square distributed, 
the sample size must be large enough. It has been found, that the approxi¬ 
mation is rather good, when the sample size is about four times greater than 
the number of cells, even if some of the expected frequencies [np( i \ 9 )] are 
quite small. If the cells are approximately equiprobable the sample size can 
be two times greater or even equal to the number of cells, depending on the 
significance level. (Conover 1980, Moore 1986) 
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Kolmogorov-Smirnov test For a sample {ii,... ,t n } of n observations 
the empirical distribution function (EDF) gives the proportion of observa¬ 
tions not greater than t: 

F n (t) = | ^ t(j) < t < t(j+ 1 ) | , j = 1, • • •, n. (2.43) 

If {ti,..., t n } is drawn from F(t\9 ), the EDF approaches F(t\0 ) as the 
sample size increases (Dudewicz & Mishra 1988): 

lim F n (t) = F(t\0). (2.44) 

n— t-oo 

The null hypothesis is that the distributions are similar enough for F n (t) to 
be the EDF of a random sample generated by F(t \6 ): 

H 0 :F n {t)~F{t \0). (2.45) 

It is possible to test the goodness of fit by measuring the discrepancy between 
the EDF and the proposed distribution. Such a statistic is called an EDF 
statistic (Stephens 1986a). 

The Kolmogorov-Smirnov (K-S) statistic D is the largest absolute verti¬ 
cal difference between F n (t) and F(t\ 6 ): 

D= sup \F n (t) — F( 1 1 6 )|. (2.46) 

t.£ Re+ 

Under Ho the distribution of D is known, and it is independent of F(t \ 9 ). 
D can be calculated as follows: 

D = max {D + , D~] , (2.47) 

where 

D+ V(rk>{« _F( ‘ o)|9) } (2 ' 47a) 

D ~ { F U>ID “UT"}' (2 ' 47b) 

The statistic is distribution-free, and its distribution is known. The signifi¬ 
cance probability of the test is 


p = 1 - F d { D | n) 


( 2 . 48 ) 
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where Fjj(-) is the PDF of the K-S distribution. 

The K-S test avoids the arbitrary discretizing of the distribution func¬ 
tion, and the transformation of the observations to counts. This process, 
troublesome in small samples, is required by the chi-square test. The reduc¬ 
tion of data makes the chi-square test less powerful than the K-S test (See 
Moore 1986). 

Stuart & Ord (1991) report some comparisons between the chi-square 
and the K-S tests: For large samples with equal significance level (0.05) and 
equal power (0.5), the K-S test can detect a deviation about half as small as 
the chi-square test can. The ratio declines steadily in favor of the K-S test as 
the sample size increases. On the other hand, to detect an equal deviation, 
the K-S test requires the sample size to be of order n 4 / 5 compared to n for 
the chi-square test. As the sample size increases, the relative efficiency of 
the chi-square test tends to zero. The K-S test is a very much more sensitive 
test of fit for a continuous distribution. It is also asymptotically much more 
efficient. 

The K-S test does not have a nonparanretric modification for the case 
that the parameters are estimated from the sample. The alternative is a 
parametric test. 


Parametric tests 


Parametric Kolmogorov-Smirnov test If the parameters of a theoret¬ 
ical distribution must be estimated from the sample, the null hypothesis 
is: 


H 0 : F n (t) ~F(t 



(2.49) 


The K-S test statistic is now: 


D(6) 


sup 

teRe+ 


Fjt) - F( t 


(2.50) 


Because fitting the parameters makes it possible to adjust the tested 
distribution more efficiently than by using predetermined parameters, the 
critical values for a given significance probability must be smaller. The test 
statistic is no longer distribution free, but different critical values relate to 
different null hypotheses (Green &; Hegazy 1976). The test depends on the 
distribution tested, the parameters estimated, the method of estimation, 
and the sample size (Stephens 1986a). 

Although the distribution of D(9) is unknown, the significance proba¬ 
bility can be evaluated by a Monte Carlo method (see algorithm 2.1). For 
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Step 1 Estimate sample parameters 9. 

Step 2 Calculate test statistic D n (9 ) for the sample. 

Step 3 Set i = 1 and k = 0. 

Step 4 Generate random replica (i) with distribution F^t 

Step 5 Estimate the parameters 9{ of replica i. 

Step 6 Calculate test statistic Di(9i) for replica i. 

Step 7 If Di(9i ) > D(9 ) set k = k + 1. 

Step 8 If ?' < to, set i = i + 1, and go to step 4. Else calculate 

(* + l) 


9 ) and size n. 


Pm = 


(to + 1) 


and stop. 


Algorithm 2.1: A Monte Carlo method for testing goodness of fit 


each sample, to random replicas are generated from El t 



For each 


replica (i), the test statistic [E*(#?:)] is calculated, and the number of repli¬ 
cas ( k ) having test statistic greater than or equal to [D(9)] is counted. The 
estimator for the significance probability is: 


k + 1 

Pm = : 7 

TO + 1 


(2.51) 


where 


k = #{i\D i (9 i )>D{9)}. (2.51a) 

Noreen (1989) has shown the validity of this test, and presented confi¬ 
dence intervals for the method. Assuming that all p m are a priori equally 
likely (Ho is assumed to be true), the probability that the true significance 
probability p is less than or equal to p u is: 


P{ p < p u | k, TO } 


f 0 Pu p k ( 1 - p) m ~ k dp 
fo P k i 1 ~ p) m ~ k dp 


(2.52) 
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The upper confidence limit p u ^ a at confidence level a can be obtained by 
solving the equation: 

P{ P < Pu,a | k, m } = a. (2.53) 

For P 9999 = 0.05 the 95 % upper confidence limit is 0.059, for pggg = 0.05 
the limit is 0.062, for P 499 = 0.05 it is 0.067, and for pgg = 0.05 it is 0.089. 

The results of the parametric K-S tests for the exponential distribution 
were compared with tables available in the statistical literature (Stephens 
1986a, Durbin 1975). There were no significant differences. 

Lilliefors (1967) compared the results of nonparametric and parametric 
K-S tests for the normal distribution: The 1 % critical values of the para¬ 
metric test were approximately the same as the 20 % critical values of the 
nonparametric test. For the exponential distribution the percentages were 
5 and 20, respectively (Lilliefors 1969). Accordingly, the nonparametric K- 
S test is much too conservative if the parameters are estimated from the 
sample. 


Other parametric tests The K-S statistic measures the largest absolute 
difference between the EDF and the proposed model. Typically the largest 
differences are in the middle region of the distribution. Near the tails the 
absolute differences are smaller, but relative differences may be quite large. 
In this sense, the K-S statistic gives less weight to the tails. The statistic 
can be improved using a quadratic measure (Stephens 1986a): 

POO 

Q = n [F n (t) - F(t)} 2 ^[F(t)} dF(t ), (2.54) 

Jo 

where ip[F(t )] is a weight function. If ip[F(t)] = 1, the statistic is the 
Cramer-von Mises statistic (IF 2 ), which can be calculated as follows (Green 
& Hegazy 1976): 


^ 2 = E 
3= 1 




n 



(2.55) 


Anderson & Darling (1952) suggested ip[F( t)] = {F(t) [1—F(t )]} -1 as the 
weight function. Because this is the reciprocal of the variance of y/n[F n (t) — 
F(f)], in a certain sense, the statistic assigns equal weight to each point of 
the distribution F(t). The Anderson-Darling (A-D) statistic (A 2 ) can be 
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calculated as (Green & Hegazy 1976): 


A 2 = -n- 2 ^ 


3 =1 


n 


In F t 


Hi) 


e j + In (1 - Fl t {n _ j+ 1 ) 


(2.56) 

The A-D test will fail if there are observations outside the range of 
the distribution. If even a single F(tj ) is 0 or 1, the statistic grows to 
infinity, leading to the rejection of Hq. Consequently, the location parameter 
(threshold for minimum headway) should always be estimated smaller than 
the shortest headway in the sample. 

Studies of these test have suggested that the parametric EDF tests are 
more powerful than the chi-square test. Among the parametric EDF tests, 
the K-S test is the least powerful, and the A-D test slightly better than the 
Cramer-von Mises. Stephens (1986 a) suggests the A-D test as a omnibus 
test statistic for EDF tests with unknown parameters. 

Based on these results, the goodness-of-fit tests for the headway data 
have been performed by the A-D method. The test procedure followed 
algorithm 2.1 with 9,999 replicas . 13 (In the tests for the semi-Poisson dis¬ 
tribution the number of replicas was, however, 500.) The nonparametric 
K-S tests were calculated for comparison. Although these results could not 
be used to test Hq, they gave a measure of the closeness of fit comparable 
between different models. The comparison of the nonparametric K-S and 
parametric A-D tests also revealed the difference in the power of these tests. 


2.2.5 Combination of probabilities 

In the analysis of multi-sample data, it is desirable to find a single measure 
to describe the overall acceptability of the null hypothesis. The problem can 
be described by an example: 

Example 2.1 The balance of a dice is tested. The null hypothesis is that the dice 
is balanced, and each outcome (1, 2, 3, 4, 5, and 6) has equal probability (g). The 
alternative hypothesis is that the dice favors large numbers. The dice is thrown five 
times, and the result is {5, 6, 6, 5, 61. The probability for this or a better result is 

(ifZUG) = 0 . 002 . 

It should be noted that the average probability (0.233) is not a proper 
measure of significance. Another interesting feature in the example is that 

13 The parameter estimation and the goodness-of-fit tests were performed by a computer 
program calied Testfit. It is coded in C language, and it calls IMSL Fortran subroutines. 
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none of the individual test probabilities pi is statistically significant, while 
the combined probability is highly significant. 

In the example the probabilities were multiplied. An alternative method 
is to operate with the sum of the logarithmic probabilities (ln/i,). The latter 
method will now be followed, as suggested by Fisher (1938) and Stephens 
(19866). 

In the study of headways, the overall null hypothesis (Hq) can be com¬ 
posite, such as: 

Hq : The headway data is generated by F( t \ ■ ) . (2-57) 


The hypothesis is composite, because the parameters of the distribution are 
unspecified. For each sample, the parameters are estimated, and the null 
hypothesis is simple: 


: The headway sample i is generated by F[ t 


e 


(2.58) 


H 0 is true if and only if Hoi is true for all samples. 

If the null hypothesis (Hoi) is true for all samples, and the tests are in¬ 
dependent, then the significance probabilities (pi) are uniformly distributed 
between 0 and 1 [U (0,1)]. If k tests are made, the probabilities p±, ■ ■ ■ ,Pk are 
a random sample from C/(0,1). This can be tested by a method presented 
by Fisher (1938). 

For the chi-square distribution with two degrees of freedom it holds: 

p = F x 2 ( x | v = 2 ) = e~2 x . (2.59) 

Taking the natural logarithm of the probability p and multiplying it by -2 
gives the equivalent value of x for two degrees of freedom: 

x = F~ 2 1 (p | v = 2) = — 2 hip. (2.60) 

If p is a uniform (0,1) random variate, then x follows the chi-square distri¬ 
bution with two degrees of freedom. 

The sum of m variates following the chi-square distribution with two 
degrees of freedom is itself chi-square distributed with 2m degrees of free¬ 
dom. If Ho is true, and the samples are independent, then the significance 
probabilities p % are C/(0,1), and the statistic 

m 

z = -2^ In Pi 
i= 1 


(2.61) 
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has chi-square distribution with 2 m degrees of freedom (Stephens 19866). 
The significance (P) of the combined test is now the probability that a ran¬ 
dom variate (Z) having chi-square distribution with 2m degrees of freedom, 
is greater than or equal to z: 

P = V{Z >z} = 1-F x 2(z\ v = 2m). (2.62) 

This method can be used to obtain a single significance probability for a 
multi-sample data set. 

2.2.6 Moving probability 

Sometimes it is desirable to know, whether a null hypothesis should be 
rejected in some conditions but not rejected in other conditions. It is possible 
to divide the samples into classes according to some factor and to calculate 
the combined probability for each group. The classification of the data is 
not, however, always obvious. For example, there is no obvious way to 
classify samples by traffic volume. 

Let us assume that a set of probabilities pi is arranged in ascending 
order according to the traffic volume. The combined probability for the 
first k samples can be calculated as described above. It describes the com¬ 
bined significance probability corresponding to the traffic volume equal to 
the mean of the first k sample volumes. Next the second sample is selected 
as a starting point, and the combined probability for k consecutive sam¬ 
ples and the corresponding mean volume is calculated. This procedure is 
repeated until the combined probability is calculated for the last k samples. 
This series of combined probabilities is called the moving probability. This 
method, suggested by Luttinen (1992), smoothes the variation of the signif¬ 
icance probabilities and increases the power of the tests by combining the 
significance probabilities for several samples. 

If the samples are arranged in an ascending order according to the traffic 
volume, the Appoint moving probability is: 

Pk(^k,j) = 1 - F x 2 ( z k,j 1^ = 2 k), (2.63) 

where 

j+k-l 

z k,j = —2 £ In (2.63a) 

i=3 
1 j+k-l 

^ k >j = y, 

i=j 


j + k — 1 < m, 


(2.63b) 



46 


CHAPTER 2. THEORETICAL BACKGROUND 


A i is the traffic volume of sample i, Xf : j is the traffic volume of the jth term, 
and m is the total number of samples. The traffic volume of term j is the 
mean of the sample volumes in the term. 

A balance must be found between sensitivity and power. A large number 
of points ( k ) in each term makes the curve smoother and increases the 
power of each combined significance, but the curve is also less sensitive to 
deterministic variations. A small number of points makes the curve more 
sensitive and ragged, but the power of each combined significance decreases. 

In a moving average the number of points should be selected as the lowest 
value producing a reasonably smooth curve. For a moving probability the 
situation is different: Assume that all tests give significance probabilities 
near 0.2. Even a small number of points produces a smooth curve, but the 
result is not realistic, because the combined probabilities are too large. (For 
a 5-point moving probability the values are near 0.1, while a 10-point moving 
probability is near 0.04.) Consequently, the number of points k in each term 
should be as large as possible, but such that the moving probability still 
covers a sufficiently wide range of traffic volumes, and that it is sensitive to 
local variations. 



Chapter 3 


Data collection 


3.1 Measurements 

The purpose of the field studies was to examine the time headways in freely 
flowing traffic on two-lane two-way roads. All the road sections selected 
were reasonably level, straight and with no nearby traffic signals. 

Arrival characteristics were measured by a traffic analyzer. The data 
were collected by the staff of the Laboratory of Transportation Engineering 
at the Helsinki University of Technology. Using two inductive loops in both 
directions (fig. 3.1), the analyzer recorded for each passing vehicle its se¬ 
rial number, gross time headway (time interval from front bumper to front 
bumper in units of 1/100 s), net time headway (time interval from back 
bumper to front bumper in units of 1/100 s), speed (in units of 1 km/h) and 
length (in units of 1/10 nr). The data were transmitted to a computer. The 
headways on both lanes were measured and analyzed separately. 

The measurements were performed during the summer of 1988 in south¬ 
ern Finland on 9 locations having speed limits 50, 60, and 70 km/h. These 
roads are called “low speed roads”. The locations and the measurement 
times were selected so that the expected flow rates were preferably below 
1,000 veh/h. The weather was mostly dry, but two measurements failed 
because of rain or detector malfunction. 

Also some data from year 1984 (Pursula & Sainio 1985) measured on 10 
locations with speed limits 80 and 100 km/h (“high speed roads”) have been 
used. These measurements were concentrated on higher volumes, because 
capacity was a major aspect of that study. Consequently, the opposite 
volumes tend also to be higher than on the low speed roads. The data 
have been previously analyzed by Pursula & Sainio (1985) and by Pursula 


47 



48 


CHAPTER 3. DATA COLLECTION 



Figure 3.1: Inductive loops of the traffic analyzer 


& Enberg (1991). In all of these measurements more than 73,000 headways 
were recorded. 

In addition to the speed limits and the volumes, the road categories have 
other differences. The speed limit describes also the road conditions, most 
importantly the road geometry and the intersection density. High speed 
roads have usually a better geometry. Lower speed limits (50 and 60 km/h, 
in particular) are typically located near densely populated areas, where the 
intersection density is high. 

3.2 Preliminary data analysis 

The data were checked for consistency. The measured speeds (km/h) were 
corrected on the basis of radar measurements. The corrected speed (u c ) was 
calculated from the measured speed ( v m ) using the relation: 

v c = 4.77 + 0.911u m . (3.1) 

Measuring periods having more than one percent passing vehicles, were 
discarded. Also, the first observation (time from the beginning of the mea¬ 
surement to the arrival of the first vehicle) and passing vehicles (crossing the 
detectors to the wrong direction) were removed from the data. Accordingly, 
the sampling was synchronous (see Luttinen 1990). 

Detection and elimination of outliers was done during the trend analysis 
on the basis of graphical displays of headways and speeds. The outliers were 
























3.3. Selection of trend free samples 


49 


quite apparent. 


3.3 Selection of trend free samples 

In order to obtain samples with stationary data, trend analysis was per¬ 
formed by a computer program called Trendana 1 showing graphically each 
headway, the 15-point moving average, the cumulative vehicle count and the 
speed of each vehicle. 

The data were analyzed sequentially using the exponential ordered scores 
trend test (section 2.2.2). The sample size was incremented by 50 vehicles 
until the test reported trend at 5 % level of significance. The sample was 
then decremented until the level of significance for trend was near 50 % or 
at least between 30 and 70 %, the sample size was greater than 100, and the 
sampling period was between 5 and 40 minutes. The 40 minute limit was 
set in order to keep the test sensitive to local trends, but under low flow 
conditions the sample size or the period length condition had to be relaxed 
sometimes. 

If a satisfactory sample was not found, the first observations in the sam¬ 
ple that seemed to cause the trend were removed, and the process was re¬ 
peated. Also, periods with low speeds indicating disturbances were excluded, 
and the process was repeated starting after the excluded observations. Sam¬ 
ples with more than 10 % heavy vehicles (longer than 6 nr) were excluded 
from further analysis. 


3.4 Description of selected samples 

The preliminary analysis and the trend tests produced 65 samples consisting 
of 16,780 observations. The samples represent speed limits 50, 60, 70, 80 and 
100 km/h (fig. 3.2). The per lane flow rates vary from 150 to 1,840 veh/h 
(figs. 3.3 and 3.4). The basic properties of the samples are listed in appendix 
A. 

The observations from high speed roads were gathered in 1984. Because 
that study was more focused on the capacity and level of service issues, 
the observations concentrate on high volumes. On low speed roads the 
observations are more concentrated on low volumes. Consequently, speed 
limits and volumes are correlated. The high speed roads also have a higher 
overall standard and more opposing traffic. Hence, care must be taken 


1 The program is coded in C language, and it calls IMSL Fortran subroutines. 
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Number of samples 



Speed limit [km/h] 

Figure 3.2: Number of samples by speed limit 


Number of samples 



Volume [veh/h] 


Figure 3.3: Number of samples by volume 
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Figure 3.4: Sample volumes by speed limit 
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in conclusions about the influence of the speed limit and the volume on 
headways. 

From low speed (50-70 km/h) roads there are no observations with vol¬ 
ume range 1,000-1,250 veh/h. From high speed (80-100 km/h) roads the 
observations are missing for volume range about 500-700 veh/h. From roads 
having speed limit 70 km/h there are only low volume (< 500 veh/h) sam¬ 
ples. In the whole data set, there is only one sample having volume greater 
than 1,500 veh/h. 

Figure 3.5 shows the volume and the space mean speed (calculated as 
harmonic mean of spot speeds) of each sample. Because the samples are from 
different locations, no curve fitting has been done. The data, however, shows 
the decrease of mean speed with increasing volume. Except for two samples 
(12 and 33), the figure does not reveal any congestion or disturbances in the 
flow. 

The low mean speed of sample 12 indicates congestion. Sample 33 has a 
low mean speed relative to flow rate (361 veh/h). Further analysis showed 
that the sample also has a very high variation in relative speeds of leading 
vehicles. This indicates some kind of a disturbance in the flow. 

After the exclusion of samples 12 and 33, the final data set consists of 63 
samples having 16,417 observations. Accordingly, less than 25 % of the total 
measurements were included in the analysis. These data describe traffic flow 
on two-lane two-way roads under stationary conditions with not more than 
10 % heavy vehicles and very few passing vehicles. 
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Mean speed [km/h] 



Figure 3.5: Volume-speed diagram 











Chapter 4 


Identification 


4.1 The purpose of identification 


A headway (T) describes the time interval between consecutive vehicles in 
traffic flow. For a traffic engineer it is interesting to examine the shape and 
structure of the headway distribution. The differences between follower and 
free flowing headways are of special interest. In capacity and level-of-service 
studies, the proportions of short and long headways deserve attention. A 
simulation designer is interested in the possible correlation between consec¬ 
utive headways. 

Identification is the process of finding, from a set of models, the best 
theoretical model to describe the data. It can be a stochastic model or, 
as here, a probability distribution. An effective description of the most 
important properties of the data is essential in order to find a theoretical 
model which has properties similar to the data. 

This chapter discusses the shape of the headway distribution, measures 
of location and dispersion, and tests for the renewal hypothesis, which are 
concerned with the autocorrelation of consecutive headways. Some of the 
methods (e.g., the hazard function and the KS 2 -chart) have not been used 
in headway studies—at least not widely. For some commonly used methods, 
more efficient procedures are presented. The concluding section presents a 
four stage identification process, which is suggested as a standard procedure 
in headway studies. 
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4.2 Shape of the distribution 

4.2.1 Empirical distribution function 

The probability distribution function (PDF) defines the probability that a 
headway random variate (T) is not greater than some given value t: 

F{t) = P{T <t}. (4.1) 


The PDF which has generated a sample of size n can be estimated by the em¬ 
pirical distribution function (EDF). It gives the proportion of observations 
less than or equal to t in the sample: 


F n (t) 



%) — * < %+i) 


3 = 1,- •• ,n- 


(4.2) 


The EDF has a central role in the goodness-of-fit tests (section 2.2.4). 

Figure 4.1 shows contour plots of empirical headway distribution func¬ 
tions. Separate plots for low speed (50-70 km/h) and high speed (80- 
100 km/h) roads were produced by smoothing the sample EDFs. Because 
the traffic volume of only three (low speed) samples is below 200 veh/h, the 
low volume contour lines are not very reliable. 

The proportion of extremely short headways is small, but between one 
and three seconds the EDF rises steeply. On high speed roads the concen¬ 
tration on short headways is more substantial than on low speed roads. At 
500 veh/h the low speed roads have about 30 % of headways shorter than 
two seconds. At the same volume the high speed roads have about 40 % of 
headways shorter than two seconds, and the proportion of short headways is 
rather constant at moderate to high flow rates. The 0.9 contours are almost 
identical in both road categories. 

At high volumes the contour lines are very densely concentrated at short 
headways, but they curve toward longer headways at lower volumes, thus 
indicating a greater proportion of long headways. This is only natural, con¬ 
sidering that traffic volume is the reciprocal of the mean headway. According 
to May (1990), the percentage of vehicles having headways less than or equal 
to the mean headway is rather constant, about 67 %. This approximation 
holds for the low speed data. 

In HCM (1985) the percent time delay is used as a level of service mea¬ 
sure. For practical purposes, the percent time delayed is approximated by 
the percentage of vehicles traveling in platoons (t < 5). The level of service 
is accordingly based directly on the empirical headway distribution. In fig¬ 
ure 4.2 the proportion of vehicles having headways less than five seconds, 
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Volume [veh/h] Speed limit: 50-70 km/h Volume [veh/h] Speed limit: 80-100 km/h 



Figure 4.1: Contour plots of empirical headway distribution functions 


is plotted against the traffic volume. Except for two low volume samples 
(45 and 46), the proportion is larger than in a totally random, or Poisson, 
process (negative exponential headways). The proportion is also larger on 
high speed roads than on low speed roads. 

It should be noted that the data in figure 4.2 are from several sites. 
Thus, the variation is partly caused by local conditions. However, the figure 
shows that the data are consistent in respect of the level of service. 

Figure 4.3 shows the proportion of headways less than or equal to one 
second. The proportion is 0.11 for high speed roads and 0.062 for low speed 
roads. There is no significant correlation with the traffic volume. For all 
but the lowest volumes, the proportion is lower than in a Poisson process. 

Enberg Sz Pursula (1992) obtained similar results for Finnish high-class 
two-lane roads. 1 The data did not indicate a significant correlation between 
very short (t < 1 s) headways and traffic volume. There is, however, some 
evidence that on Finnish freeways the proportion of headways less than one 
second increases with increasing traffic volume (Salonen 1982). 

1 These roads have a very high standard of alignment and grade-separated intersections. 
The speed limit is 100 km/h, and only fast motor vehicles are allowed. 
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F(5) 



Figure 4.2: Proportion of headways less than five seconds 


F(l) 



Figure 4.3: Proportion of headways less than or equal to one second 
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4.2.2 Empirical density function 

The probability distribution function (PDF) can be defined in terms of the 
probability density function (pdf): 

F(t) = [ f(x) dx. (4.3) 

Jo 

Thus, the pdf is the derivative of the PDF, and: 

x ) dx = 1. (4.4) 

Both the PDF and the pdf have the same and complete information about 
the distribution, but the pdf is visually more informative. Because of this, 
the plotting of the pdf estimates (empirical density functions, edf’s) is a 
standard procedure in headway studies. 

The density function is usually estimated by the histogram method. 
Starting from an origin (to), the range of the data is partitioned into bins 
(class intervals) having the same width ( h ), which is also called the window 
width. The height of the histogram at point t is the number of observations 
belonging to the same bin as t divided by the number of total observations 
(n) and the bin width (h): 

fn(t ) = ^ #{i I [*o + jh < t < to + (j + 1 )h] 

A [to + jh < ti < to + (j + 1 )h) } 

% = 1, ..., n and j = 0,1,... (4.5) 

The total area under the histogram is unity. Once the origin and the bin 
width are selected, the method is very straightforward, and there are many 
programs to automate the process. 

The histogram method has two major drawbacks: Firstly, the location 
of the origin (to) and the bin width ( h ) can have a considerable effect on 
the shape of the estimate, especially at low values of t. Secondly, the dis¬ 
continuity of the histograms causes even more difficulties if the derivatives 
of the density estimate are needed, or if the density estimate is needed as 
an intermediate component for other methods. (Silverman 1986) 

The histogram method calculates the frequency of observations in a win¬ 
dow of width h. The window is first placed at the origin, and then shifted to 
the right by its own width. Each observation in the window is given equal 
weight. Better estimates are, however, obtained if the the window slides 
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smoothly along the x-axis, and the observations are weighted by an appro¬ 
priate smoothing function (kernel). The observations in the middle of the 
window should have greater weight, while the observations near the sides of 
the window should be given less weight. This procedure is called the kernel 
method. 

The kernel method produces a continuous estimate of the pdf, avoiding 
the pitfalls of the histogram method, but some noise in the tail cannot be 
avoided without losing essential details. The kernel density estimator with 
kernel K(-) at point t is defined as: 

w = *± K (nr)- <«> 

2—1 X 7 


Too small values of h produce noisy estimates. On the other hand, too large 
values smooth away essential details of the density. 

For headway data, the estimate should reproduce the sharp peak of the 
pdf without too much noise in the tail. As the density estimates were further 
smoothed in order to produce the surface plots (figures 4.4 and 4.5), a small 
smoothing parameter (h = 0.3) was selected. This estimator was able to 
produce the sharp peak of the density, even though there was some noise in 
the tail. 

The estimates are based on the Epanechnikov kernel (Silverman 1986): 


K(x) 



(1 - b 2 ) > 


if |x| < y/E 
otherwise. 


(4.7) 


It is the most efficient kernel function, although the differences between the 
best kernels are very small (Silverman 1986). 

Figures 4.4 and 4.5 show surface plots of headway density estimates. 
The surfaces were smoothed by a polynomial fit for each t in the grid. The 
method produces some wavelike nonsmoothness in the t direction of the 
surface, but the shape of the density functions is very clear. The method of 
distance weighted least squares was also tried. The surfaces were smoother, 
but the peak was lower, and it had saddle points in the regions of missing 
observations (see Luttinen 1994). 

The density is unimodal, bell-shaped and skewed to the right. The pro¬ 
portion of headways less than one second is small. The mode is between 
one and two seconds, after which the frequencies diminish in an exponential 
way. The most striking difference between the figures is that under low vol¬ 
ume conditions the frequency of short headways is much lower on low speed 
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Volume [veh/h] 


Headway [s] 


Figure 4.4: Empirical headway density function on low speed (50-70 km/h) roads 



Headway [s] 

Figure 4.5: Empirical headway density function on high speed (80-100 km/h) 
roads 
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roads. As the volume increases, the frequency of short headways, of course, 
increases on both road categories. 

The difference between the two road categories may be caused by greater 
variation in speed differences on high speed roads (fig. 5.30, page 133), dif¬ 
ferent road conditions, and differences in the opposite flow level. The low 
speed (50-70 km/h) data were collected near densely populated areas, where 
intersection density was higher than on high speed (80-100 km/h) roads. 

The edf’s have some similarity with the pdf’s of the gamma and the 
lognormal distributions. Also, several mixed distributions have similar den¬ 
sities. The densities of the negative exponential and the shifted exponential 
distributions are, however, different. These distributions have the great¬ 
est density at the shortest headways, whereas the edf increases smoothly 
from zero to the mode. Consequently, the exponential distributions are not 
appropriate models for short headways. 


4.2.3 Empirical hazard function 

The hazard function is widely used in reliability and life data analysis. It 
gives the instantaneous failure rate for a system that has not failed until t. 
The hazard function for a random variable T is defined as: 


, . , P{f < T < t + At If < T } 

hit) = lim --- 

v ' Ai^O At 

m 

1 -F(ty 


(4.8) 


The pdf and the PDF can be expressed in terms of the hazard function as 
follows: 


f{t ) = h(t ) exp J h[u ) dii'j (4.9) 

F(t) = 1 — exp J h{u ) du ^ . (4.10) 

The hazard function also has the complete information of the distribution. If 
the hazard function has a simple shape, it is possible to form a hazard-based 
model using the equations above. 

In terms of traffic flow, the hazard function h(t) is the instantaneous 
termination rate of headways conditional upon lasting to time t. Besides 
of its theoretical significance, the hazard function shows properties of the 
distribution that are not so clearly seen in either the pdf or the PDF. 
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A crude estimate of the hazard function can be quickly calculated by 
classifying the data into intervals of length a, and calculating: 


h n {t) = 


#{tj 1 t < tj < t + a} 
~H~ {tj \t < tj } 


i,j = 1 


(4.11) 


Better estimates can, however, be obtained by the kernel method. The 
smoothed empirical hazard function (ehf) of the headway data was calcu¬ 
lated using a biweight kernel (IMSL 1989): 


K(x) 



(i-.* 2 r 


if |x| < 1 
otherwise. 


The smoothed kernel is: 


Ks{t £(i)) 


J- K , 

ad ik \ Pd ik J 


(4.12) 


(4.13) 


where a, (3, and k are smoothing parameters, dik is the distance to the kth 
nearest “failure” from tu\, and tu\ is the zth order statistic of the sample. 
The smoothing parameters were estimated by a modified maximum likeli¬ 
hood method. 2 Consequently, the ehf was calculated as follows: 


K{t) = J2 n _ ] + i Ks (* ~ *(<))» ( 4 ‘ 14 ) 

1=1 


where n is the sample size. 

Figures 4.6 and 4.7 show surface plots of the empirical hazard functions. 
The surfaces are smoothed by making a polynomial fit for each t in the grid. 
The method produced a little nonsmoothness in the t direction, but the 
shape is obvious. The method of distance weighted least squares was also 
tried. It produced smoother surfaces, but the peak had saddle points in the 
regions of no observations. 

The figures have some common properties: The peak is between 1.5 and 
2 seconds. It moves slightly to the right, as the volume increases. The peak 
is low at low volumes, and increases as the volume increases. After the peak, 
the ehf decreases, until it reaches a constant level. This level is higher at 
high volumes. 

2 IMSL subroutines DHAZEZ and DHAZST were used to estimate the parameters and 
to calculate the empirical hazard functions. 
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Figure 4.6: Empirical headway hazard functions for low speed (50-70 km/h) roads 
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Figure 4.7: Empirical headway hazard functions for high speed (80-100 km/h) 
roads 
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At low volumes the peak is higher on high speed roads. When the 
volume on low speed roads is greater than about 1,000 veh/h, the peak rises 
very steeply. There are, however, only two low speed samples for volumes 
higher than 1,000 veh/h. The extremely high peak is caused by one small 
(n = 150) sample (no. 52), which has h( 2.0) = 1.8. Thus, the high peak 
should be viewed with some skepticism. Although the ehf in figure 4.6 is 
nonsmooth at high volumes, the basic shape is obvious. 

The hazard function of the exponential distribution is constant, and equal 
to the flow rate. The exponential tendency of low volume traffic is evident 
on low speed roads; the ehf becomes more level as the volume decreases. 

If the tail of the distribution is exponential, the hazard function reaches 
a constant value at long headways. This value is proportional to the flow 
rate. Constant ehf values at large headways are accordingly an indication 
of an exponential tail in the headway distribution. 

The hazard rate of the gamma distribution is monotone decreasing from 
infinity if the shape parameter (a) is less than unity, monotone increasing 
from zero if a is greater than unity, and in both cases approaches a constant 
value (scale parameter /3) as t becomes large (Kalbfleisch & Prentice 1980). 
If the shape parameter is a = 1, the distribution is exponential, and the 
hazard function remains constant [h(t] = /?]. This kind of shape is seen 
at the lowest volumes on low speed roads, except for the low peak in the 
ehf. This suggests that at low volumes the headway distribution resembles 
a gamma distribution, and the shape parameter (a) approaches unity from 
above, as the flow rate decreases to zero. Because h( 0) = 0 for a > 1, the 
gamma distributions do not produce extremely short headways, and because 
h(t) approaches Past becomes large, the tail of the distributions approaches 
exponentiality. 

The lognormal hazard function is zero at t = 0, increases to maxi¬ 
mum, and then decreases, gradually approaching zero at long headways 
(Kalbfleisch & Prentice 1980). The basic shape is similar to the ehf, but the 
fall after the peak is more gradual. 

The hazard function of the semi-Poisson distribution is similar to the 
headway hazard function estimate. The model in figure 5.49 (page 160) has 
gamma distributed short headways and exponential tail. 

In conclusion, the ehf suggests a mixed distribution with an exponential 
tail as a theoretical headway distribution. The lognormal distribution has 
desirable properties at short headways only. The gamma distribution could 
be considered for low volumes. 
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4.3 Measures of location and dispersion 

4.3.1 Mode and peak height 

The mode (Mo) is the point where the density reaches its maximum. So, it 
is an approximation of the most frequent value in the distribution. Summala 
& Vierimaa (1980) describe the mode as an approximation of the headway 
that most drivers select when they are following the vehicle ahead. As such, 
the mode is a measure of typical platoon behavior and a crude estimate of 
highway capacity. 

The peak height (. Ph) is a measure of the frequency at mode. For uni- 
rnodal distributions the mode and the peak height can be defined as: 

Mo = {t € Re + I f(t) = max f(x)} (4.15) 

a:SRe+ 

Ph = f(Mo). (4.16) 

The mode and the peak height of the headway data were estimated using 
kernel estimates for density. The smoothing parameter of the Epanechnikov 
kernel (4.7) was h = 0.2. A low smoothing parameter was used, because the 
density was estimated only at the range where the observations were most 
densely concentrated. Consequently, the peak height obtained was higher 
than in the density estimates. The golden section search (Gill, Murray & 
Wright 1981) with absolute precision of 0.01 was used to find the maximum 
of the edf. 

The mode values are shown in figure 4.8. There is no significant cor¬ 
relation with flow rate. The mode is about 1.53 (with standard deviation 
of 0.29) on low speed roads and 1.41 (with standard deviation of 0.22) on 
high speed roads. The difference is, however, not statistically significant at 
5 % risk level (P = 0.08), according to the t-test. Thus, the mode is a little 
below 1.5 (95 % confidence region is 1.46-1.48) at both high and low speed 
roads. 

Summala & Vierimaa (1980) measured the modes of headway distribu¬ 
tions on two-way two-lane roads with speed limits 50-100 km/h. The mode 
varied between 1.0 and 1.9, but it was rather constant at all traffic volumes. 
Rajalin & Hassel (1992) studied net time interval (gap) distributions on 
two-way two-lane roads with speed limits 80 km/h and 100 km/h. They 
reported a mode of 0.8 s at all traffic volumes. 

The peak heights are shown in figure 4.9. The peak rises as the flow 
increases. On high speed roads (speed limit 80-100 km/h) the peak is higher 
than on low speed roads, especially at low volumes. At high volumes the 
speed limit looses its significance. 
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Figure 4.8: Mode of empirical headway distributions 
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Figure 4.9: Peak height of empirical headway distributions 
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A regression analysis was performed by means of a logarithmic regression 
[a + Mn(A)] for low speed roads and a power regression (a\ b ) for high speed 
roads. The fit is reasonable: R 2 = 0.737 on low speed roads and R 2 = 0.620 
on high speed roads. The equality of residual means between high speed 
and low speed roads was tested using the t-test. The hypothesis should 
be rejected on risk level P = 0.002. The exclusion of the two very low 
(< 0.05) peak values (samples 45 and 46) makes the result statistically more 
significant (P = 0.001). Similar result (P = 0.007) was obtained using 
analysis of covariance (ANCOVA). Accordingly, separate models should be 
presented for lower and upper speed levels. 

Enberg & Pursula (1991) reported only small differences in peak heights 
on high-class two-lane rural roads in Finland. The sampling period was 15 
minutes. The mode was in the interval 1-2 s. When 60 minute samples 
were used, the increase in the peak height was more evident (Enberg & 
Pursula 1992). 


4.3.2 Median 

The median ( Md) of a continuous random variate is the 50th percentile of 
the distribution: 

P{T < Md} = 0.5. (4.17) 

The sample median is a value such that half of the sample observations are 
to the left and half to the right of it: 


Md 


trfy, if n = 2k — 1 

\ {t(k) + t(k+ 1 )) s if n = 2k. 


(4.18) 


where is the £:th order statistic of the sample, and n is the sample size. 
Sample medians of the headway data are shown in figure 4.10. The median 
on high speed roads is surprisingly level ( R 2 = 0.462), but there are only 
few low volume samples. On low speed roads the volume and the median 
are more clearly correlated ( R 2 = 0.855). 

For symmetrical unimodal distributions the mean, the median and the 
mode coincide. It is a known property of nonsymmetric unimodal distri¬ 
butions that the mean, the median, and the mode occur in the same or 
the reverse order as in the dictionary, and that the median is nearer to the 
mean than to the mode (Stuart & Ord 1987). Because the headway pdf is 
positively skewed, these measures occur in the reverse dictionary order. 
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Headway [s] 



Figure 4.10: Median of empirical headway distributions 


According to Stuart & Ord (1987) the following approximate relation 
holds for unimodal probability distributions of moderate asymmetry: 

Md-Mo&3[n(T)-Md]. (4.19) 

The headway distribution is, however, too asymmetric for this relation to 
hold, and the relation gives too high estimates of the median. The linear 
least squares estimates are (fig. 4.11): 

Md L = 0.529 + 0.465t (4.20) 

Md H = 1-601 + 0.107t. (4.21) 

The accuracy of the estimate for high speed roads ( Mdn ) is, however, sus¬ 
pect, because there are only few low volume observations. The coefficient of 
determination is only R 2 = 0.520 for Mdn , as compared to R 2 = 0.825 for 
Md L . 

Separate models were estimated for low and high speed roads, although 
ANCOVA gave significance probability of only P = 0.059 for the homogene¬ 
ity of slopes. The figure, however, shows that the slopes of the regression 
lines are quite different, but most of the points from the high speed roads 
are near the intersection of the lines. In order to stabilize the variance, a 
weighting factor of 1/P was used. The R 2 for the weighted values was 0.966 
for the low speed data and 0.990 for the high speed data. The adjusted 
significance probabilities were P < 0.001 and P = 0.002, respectively. The 
regression is accordingly statistically significant for both data sets. 
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Median 



Figure 4.11: Mean and median of empirical headway distributions 


4.3.3 Coefficient of variation 

The coefficient of variation (c.v.) is the proportion of the standard deviation 
to the mean of a random variable (T): 

*{T) 

p(Ty 

The sample c.v. is the proportion of the sample standard deviation to the 
sample mean: 

C = 1 (4.23) 

The negative exponential distribution (headway distribution of a totally 
random arrival process) has C[T ) equal to unity. Because the negative 
exponential distribution is central in the theory of stochastic processes, and 
it is widely applied in traffic flow theory, C(T) = 1 provides a convenient 
point of comparison. 

The CV-chart in figure 4.12 shows the sample coefficients of variation 
for the headway data. Some basic properties can be observed: 

1. Under heavy traffic conditions, the proportion of freely moving vehicles 
is small. The variance of headways is accordingly small. In figure 4.12, 
the c.v. decreases below unity at high flow levels. 
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Coefficient of variation 



Volume [veh/h] 

Figure 4.12: CV-chart (sample coefficient of variation) of the headway data 

2. The Poisson tendency of low density traffic has a theoretical (Breiman 
1963, Heidemann 1990) as well as an intuitive basis: Under low density 
conditions, the vehicles can move freely, and randomness of the process 
increases. Hence, the c.v. is expected to approach unity as the traffic 
ffow approaches null. This phenomenon can be observed only at low 
speed roads, where more low volume samples are available. 

3. Under medium traffic, there is a mixture of free-ffowing vehicles and 
followers. This increases the variance above pure random process. 
Accordingly, the c.v. rises above unity. This is in contrast to the 
statement of May (1990) that the c.v. approaches unity under low 
flow conditions, but decreases continuously as the flow rate increases. 
According to Pursula & Sainio (1985) the result of May is valid for 
the first lane on a freeway. 

4. The c.v. is larger on high speed roads than on low speed roads. In 
these data, the opposite flow rate was higher on high speed roads, 
thus reducing passing opportunities. Other explaining factors may 
be higher variation of speeds and greater willingness to pass on high 
speed roads. In addition, the intersection density was higher on the 
low speed roads. Consequently, there were more joining and departing 
vehicles, thus reducing the platooning. 

To test whether it is appropriate to model the C(T) on high speed and 
on low speed roads separately, a third degree polynomial curve was fit to the 
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c.v. data. The equality of the means of the residuals for high speed and low 
speed roads were tested using a t-test. Using residuals instead of actual c.v. 
values, increases the power of the test, because the effect of traffic volume 
can be circumvented. The equality of the means should be rejected at risk 
level 2.2 • 1CU 6 . Because there was a significant difference between the c.v. 
values on high speed and low speed roads, a separate model was construed 
for both. 

Two polynomial curves were fitted on the data (figure 4.12): one for 
the high speed (80-100 km/h) roads and the other for the low speed (SO¬ 
TO km/h) roads. The regression equations are: 

C L = -7.667A 2 + 2.247A + 1 (4.24) 

C H = (1 - 1.8A)(1 + 20.173A - 82.189A 2 + 114.839A 3 ). (4.25) 

The coefficient of determination for the mixed model is R 2 = 0.685. That is, 
about 69% of the variance in the c.v. can be explained by the traffic volume. 
A single model for all speed limits has R 2 = 0.226. Modeling high speed 
and low speed roads separately explains about 46% more of the variance. 

There are some points worth consideration in the equations. The curves 
were forced to unity when the traffic volume was zero. This was based on 
the assumption of the Poisson tendency in low density traffic. The upper 
curve was forced to zero at volume 2,000 veh/h in order to prevent extensive 
curviness. 

The curves have maxima at flow rates 530 veh/h (low speed roads) and 
440 veh/h (high speed roads). This corresponds to mean headways 6.8 s and 
8.2 s. Under lower flow rates, the long headways predominate and reduce 
the c.v. Traffic density is low, and there are ample opportunities for passing. 
Under higher flow rates, short headways predominate and reduce the c.v. 
Drivers are more content to follow in platoons (Pursula & Sainio 1985). 
Thus, the maximum is likely to be connected with the passing behavior. In 
Bureau of Public Roads (1950) it was shown that typically the demand for 
passing becomes greater than the opportunities at flow rate 500 veh/h (see 
also Wardrop 1952). 

These observations gain at least partial support from other authors, as 
seen in figure 4.13, although the results are based on very different data. 
The data of Dunne et al. (1968) are from a two-lane rural road, but the 
data of Breiman et al. (1977), Buckley (1968) and May (1965) come from 
freeway lanes. The traffic volumes in all the samples of May are higher than 
in the samples of Dunne et al. (1968). 

The traffic volumes in the samples of Dunne et al. (1968) are between 350 



4-3. Measures of location and dispersion 


71 



Volume [veh/h] 


Figure 4.13: Coefficient of variation in other headway studies: Dunne et al. 

(1968), Buckley (1968), May (1965), Breiman et al. (1977), and Grif¬ 
fiths & Hunt (1991) 


and 700 veh/h; i.e., the volume range that has the highest c.v. values in figure 
4.12. In accordance with these results, all the samples of Dunne et al. have 
c.v. greater than unity. The c.v. is less than unity in the freeway samples of 
May (1965) and Breiman et al. (1977) (lanes one and two), and near unity in 
the samples of Buckley (1968) and Breiman et al. from freeway lane three. 
The data of Buckley represent a wide variety of volume levels, and the c.v. 
is similar to the results for the low speed roads in figure 4.12. The data 
of May were measured for gap availability studies near freeway ramps, and 
results are very different from the others. More general conclusions from the 
data of May should be drawn only with great care. 

Griffiths & Hunt (1991) have studied headways on urban areas. They 
found a linear relation between the standard deviation and the mean head¬ 
way, which can be expressed in terms of the sample c.v. and the traffic 
volume (A, veh/h): 


C 


1.07 + 


0.160A 

3600 


(4.26) 


This line is shown in figure 4.13. 


Finnish studies by Pursula & Sainio (1985) suggest that the c.v. on 
freeways, especially on the first lane, is lower than on two-lane highways 
with speed limit 80-100 km/h. The results of Breiman et al. (1977) also 
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show that lane one has lower c.v. than lane two, which has lower c.v. than 
lane three. 

4.3.4 Skewness and kurtosis 

The proportion of the first two moments was discussed above. The third and 
the fourth central moments, skewness ( 0 : 3 ) and kurtosis ( 04 ), give us more 
information about the shape of the distribution. Skewness is a measure 
of symmetry. For symmetric distributions 03 = 0. If the data are more 
concentrated on the low values, as in a headway distribution, skewness is 
positive. Kurtosis is a measure of how “heavy” the tails of a distribution 
are. Sample skewness and kurtosis are defined as follows: 


, E”=i (ti-ty 

OL 3 = 0 

ns J 

(4.27) 


(4.28) 


where t and s are sample mean and standard deviation, respectively. 

The skewness of the headway data is displayed in figure 4.14. Analysis 
of covariance shows that the skewness of headway distributions on low speed 
and high speed roads differs significantly (P < 0.001). Consequently, the 
linear regression analysis was performed separately for the low speed and the 
high speed data. On low speed roads the linear model explains 36 % (R 2 = 
0.36) of the variance, and the significance of the regression is P < 0.001. 
On high speed roads the respective values are R 2 = 0.073 and P = 0.136, 
indicating that the correlation is not significant, and the skewness is about 
4.05, irrespective of the traffic volume. 

On low speed roads, the skewness is near two at low volumes. This 
corresponds to the skewness of the negative exponential distribution. At 
higher volumes the skewness increases. 

Following Ramberg, Tadikamalla, Dudewicz & Mykytka (1979), figure 
4.15 shows the sample kurtosis against the squared sample skewness. (The 
figure is called here a KS 2 -chart.) This relation is occasionally used as a 
guide in selecting theoretical distributions (Cochran & Cheng 1989). The 
relation in the KS 2 -chart is almost linear, but a slightly better fit (R 2 = 
0.981) was obtained by the power curve: 

a 4 = 2.297(a|) 0 ' 847 . (4.29) 

For comparison, figure 4.15 shows also corresponding points and curves of 
some theoretical distributions. The negative exponential distribution (along 



4-3. Measures of location and dispersion 


73 


Skewness 
10 n 


Speed limit 
50-70 km/h 



o -(—,—,—-—'—i—'—'—'—-—i—'—'—'—'—i 

0 500 1 000 1 500 

Volume [veh/h] 


Figure 4.14: Sample skewness 



Figure 4.15: KS 2 -chart (kurtosis and squared skewness) of the headway data and 
some theoretical distributions 
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with normal and uniform distributions) reduces to a single point. As the 
squared skewness grows the kurtosis grows more slowly than in the case of 
either the gamma or the log-normal distribution, but the gamma distribution 
is more close to the observed values. The strong correlation of the squared 
skewness and the kurtosis motivates a further search for the theoretical 
headway distribution. 

4.4 Tests for the renewal hypothesis 

4.4.1 Background 

According to the renewal hypothesis, the headways are independent and 
identically distributed (i.i.d.): 

H 0 :P{Ti<t | Ti,... ,Ti_! } = P {T t <t} = F(t ), i = 2,3,... (4.30) 


and 


P{Ti <t} = F(t ) . (4.30a) 

Such a stochastic process is called an ordinary renewal process. If the first 
headway (Ti) has a different probability distribution, the process is called 
a modified renewal process. In the headway data under study, the first 
headway in the sample is measured from an arbitrary arrival (synchronized 
sampling). Consequently, the first headway can be assumed to have the 
same distribution as the other headways. 

The alternative hypothesis is that the headways are mutually correlated: 
If a short (long) headway has a higher probability after previous short (long) 
headways the data has positive autocorrelation. If a short (long) headway is 
more probable after long (short) headways the autocorrelation is negative. 
If the headways can be assumed i.i.d., the statistical analysis and modeling 
of vehicle headways is much less complicated. 

At low flow rates, with only small differences in desired speeds and good 
passing opportunities, the vehicles can move freely, and correlation between 
consecutive headways is unlikely. At high flow rates, with large variation 
in desired speeds and restricted passing opportunities, platooning increases. 
Because the speeds of the followers are restricted, the platoons are likely 
to grow. A short headway is then more likely followed by another short 
headway. This makes the hypothesis of positive correlation plausible. On 
the other hand, negative autocorrelation would appear in such conditions 
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which favor platoons of size two (each leader has one follower). This situation 
could realize, when both the demand and the opportunities for passing are 
“reasonable”. Otherwise the hypothesis of negative autocorrelation is not 
justifiable. Consequently, positive autocorrelation should be considered as 
an alternative to the renewal hypothesis. 

If consecutive headways are positively correlated, short and long head¬ 
ways are not randomly distributed, but clustered. This can be tested by the 
runs test. In addition, the positive autocorrelation affects the distribution of 
platoon sizes. Consequently, three methods are presented to test the renewal 
hypothesis for vehicle headway: (1) autocorrelation analysis, (2) runs tests 
for randomness, and (3) tests for the distribution of the platoon length. This 
set of three tests was first proposed by Breiman et al. (1969). The power 
of these tests is now further enhanced by calculating the combined proba¬ 
bilities and the moving probabilities. The combined probabilities of some 
earlier test results are also presented. 


4.4.2 Autocorrelation 

The sample autocorrelation (or serial correlation) coefficient gives a measure 
of correlation between observations at different distances apart. For a sample 
of n observations the autocorrelation coefficient at lag k is: 


ep '(tj-m+k-i) 


(4.31) 


where 



n 



(4.31a) 


Note that po = 1. The sample autocorrelation coefficients are plotted in a 
correlogram (see figure 4.16). 

In a sample of n observations the autocorrelation coefficients are asymp¬ 
totically normally distributed with mean zero and standard deviation 1/n 
(Brockwell & Davis 1987). Thus, under null hypothesis, approximately 95 % 
of the coefficients are within bounds ±1.96n -1 / 2 . The dashed lines in figure 
4.16 show these bounds. 

Autocorrelation coefficients give a rough guide, whether the observations 
are from a renewal process (py = 0, k = 1,2,...). The most important 
coefficient in this respect is p\. Figure 4.17 displays p\ for each headway 
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Autocorrelation coefficient Autocorrelation coefficient 



Figure 4.16: Correlograms with 0.95 confidence limits 


sample. The coefficients have a large variance, but most values are positive. 
This indicates that there may be some small positive autocorrelation, but 
larger samples are needed to detect it. 

Two methods have been used to make the evidence stronger: (1) The sig¬ 
nificance probabilities of p\ for each sample were combined using the moving 
probabilities method. (2) Samples were grouped and partial correlation co¬ 
efficients for autocorrelation were calculated. The methods and their results 
are described below. 

The test for autocorrelation at lag 1 is: 


H 0 : pi = 0 


(4.32) 


against 


H A : p i>0. (4.33) 

This is a one-sided test in contrast to the two sided tests normally used 
(Dunne et al. 1968, Breiman et al. 1969, Chrissikopoulos et al. 1982), because 
it is reasonable to assume that platooning causes positive autocorrelation. 
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Figure 4.17: Autocorrelation coefficients at lag 1 


The test for negative autocorrelation gave clearly nonsignificant values. The 
autocorrelation (pi) is negative in some small samples only. For samples 
larger than 250 observations, only one negative autocorrelation was found 
(sample 11: p\ = —0.017). 

The variation of the significance probabilities over traffic volumes (fig. 
4.18) is described by the moving probability method (see section 2.2.6). On 
high speed (80-100 km/h) roads there is significant autocorrelation between 
consecutive headways. There is, however, a spike in the moving probability 
curve, which may indicate better passing opportunities in some samples. 
(Similar, although lower spikes are in figures 4.19 and 4.20.) On low speed 
(50-70 km/h) roads there is no significant autocorrelation, at least under low 
volumes. The moving probability curve, however, goes down to significant 
values near 900 veh/h. 

The combined significance of all high speed samples is about 3 • 10 —22 , 
and for all low speed samples 0.04. Consequently, the renewal hypothesis 
should be rejected on high speed roads, at least when the traffic volume 
is above 500 veh/h. On low speed roads the possibility of autocorrelation 
should be considered at high volumes (A > 900 veh/h). 

On high speed roads the combined pi is 0.07. Since largest samples give 
more accurate estimates, the sample coefficients were weighted by the sample 
sizes. The weighted estimate is pi = 0.09, and the 95 % confidence interval 
is 0.06-0.12. On low speed roads the coefficient does not differ significantly 
from zero. Accordingly, the tests indicate a small, but statistically significant 
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Significance 



Figure 4.18: Significance of tests for renewal hypothesis against positive autocor¬ 
relation (pi > 0), and 9-point moving probabilities 


autocorrelation at lag one on high speed roads. 

The autocorrelation was also tested using the partial correlation coeffi¬ 
cients. The samples of low speed and high speed roads were combined into 
three classes: (0, 500], (500, 1000], and (1000, 1500] veh/h. Because each 
sample has a different mean headway (t ), there is correlation between head¬ 
ways and their sample means r t y = corr(f,, t ). The correlation of headways 
and sample means increases the autocorrelation \p± = corr(t*, fj_i)] in a com¬ 
bined sample. The effect of the mean headway can, however, be eliminated 
by the partial correlation coefficient (Fisher 1938): 

Pit = corr( ti, U -1 | t) = — - t AJizlL [(1 _ r 2 _)(! _ r t 2 ,_ it -)] 5 . (4.34) 


The partial correlation coefficient for autocorrelation at lag one gives the au¬ 
tocorrelation coefficient, when the mean headway is held constant. Because 
r ti i and r t ._ x i can be assumed equal (rffi), the partial correlation coefficient 
can be calculated as: 


Pit = 


Pi ~ 
1 - 



(4.35) 


( 4 . 36 ) 


The null hypothesis 


H o ■ Pit = 0 
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can be tested using the following test statistic (Fisher 1938): 

y = ^ ^4, (4.37) 

v 1 " Pit 

where n is the number of observations in the class. The statistic y follows 
f-distribution with n — 4 degrees of freedom. 3 The test results are presented 
in table 4.1. 


Table 4.1: Partial correlation coefficients (pi-t) 


Volume class 

n 

Pit 

Significance 

50-70 km/h 

0- 500 

veh/h 

2494 

0.017 

0.198 

500-1000 

veh/h 

2832 

0.025 

0.092 

1000-1500 

veh/h 

500 

0.053 

0.120 

0-1500 

veh/h 

5826 

0.018 

0.081 

80-100 km/h 

0- 500 

veh/h 

580 

0.064 

0.062 

500-1000 

veh/h 

2361 

0.093 

3.30 • 10~ 6 

1000-1500 

veh/h 

7650 

0.092 

3.94 • 10" 16 

0-1500 

veh/h 

10591 

0.076 

3.24 • 10 -15 


The partial correlation coefficients do not indicate any significant auto¬ 
correlation at lag one on low speed roads. On high speed roads the positive 
autocorrelation is extremely significant under volumes 500-1,500 veh/h. At 
lower volumes the autocorrelation is not significant, but the calculation is 
based on 580 observations only. 

The results of the moving probability method and the partial correlation 
coefficients are similar. To sum up, the tests indicate that on high speed 
roads at volumes greater than 500 veh/h the autocorrelation coefficient at 
lag 1 is about 0.09. Under lower volumes and on low speed roads, there is no 
evidence of autocorrelation. It should be emphasized that the decisive factor 
may not be the speed limit, at least alone. Other road conditions, such as 
the intersection density and the opposite flow rate may have an impact on 
the autocorrelation, as well. 

Some earlier results are discussed below. Combined probabilities and 
one-sided test results have been calculated by the present author on the 

3 The degrees of freedom are k — 3, where k is the number of correlation pairs {ti,ti- 1 }, 
which is equal to n — 1. This is also true for the rightmost rooted term in the equation 
for y. 
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basis of the original results. Dunne et al. (1968) studied the autocorrelation 
of trend-free samples from a two-lane road. The combined probability of 
their nine data sets (lag 1) was 0.702 (one-sided test), which is consistent 
with the renewal hypothesis. Two-sided test gives 0.284, which is also non¬ 
significant. Breiman et al. (1969) found in one of eight data sets (three-lane 
unidirectional section of John Lodge Expressway in Detroit) significant au¬ 
tocorrelation (lag 1) at 0.05 level. The hypothesis of independent intervals 
was not rejected. The combined significance is 0.23. Testing for positive 
autocorrelation only (one-sided test) the combined significance is 0.05, sug¬ 
gesting possible positive autocorrelation. In a later article Breiman et al. 
(1977) proposed that a small positive correlation between successive space 
headways may be present. Cowan (1975) studied 1,324 successive head¬ 
ways on a two-lane road in Australia. On the basis of the autocorrelation 
coefficient and a runs test, the renewal hypothesis was not rejected. Chris- 
sikopoulos et al. (1982) studied six headway samples from U.K. trunk roads. 
The result of their unspecified tests was that the headways were indepen¬ 
dently distributed. However, the combined significance probability (lag 1) 
of their data is 0.012 using 2-sided test, and 0.0015, when testing against 
positive autocorrelation. 

Previous studies have so far given support to the renewal hypothesis. 
Further analysis of this material has cast some doubt on the conclusions. 
Also the new data presented, shows that the possibility of a small positive 
autocorrelation between consecutive headways should be taken seriously. 
Although the autocorrelation is statistically significant, it is virtually in¬ 
significant in many applications. 


4.4.3 Randomness 

Randomness of the headway data was tested using the Wald & Wolfowitz 
(1940) runs test. It tests, whether long and short headways are randomly 
distributed, or if short headways are clustered. Testing runs above and 
below the median ( Md ) (Madansky 1988) is appropriate in this case (see 
also Breiman et al. 1969). 

Let each observation L be associated with a variable Xi so that: 


Xi = 



if ti > Md 
if ti < Md. 


(4.38) 


The observations equal to the median are ignored. A sequence {xj + 1 , 
xj. |_2 ,- • •, Xj + k} is called a run if xj + \ = Xj + 2 = ■ ■ ■ = Xj + /~ and Xj / Xj+i 



4-4- Tests for the renewal hypothesis 


81 


Significance 



Volume [veh/h] 

Figure 4.19: Significance probabilities of one-sided runs tests and 7-point moving 
probabilities 


and Xj + k 7 ^ Xj+fc+i. The first and the last observation begin and end, re¬ 
spectively, a run. 

The number of runs is asymptotically normally distributed with mean 
and variance equal to (Wald & Wolfowitz 1940): 


2 r(n — r) 

fL =-b 1 

n 

2 2 r(n — r ) [ 2 r(n — r) 


— n 


n 2 (n — 1 ) 


(4.39) 

(4.40) 


where n is the total number of observations and r is the number of observa¬ 
tions below the median. Clustering reduces the number of runs. The number 
of runs is reduced also by a trend in the data. Hence, the trendlessness of 
the data is important. 

The one-sided tests for the probability of less runs were used, in order 
to test if the platooning reduces the number of runs. Figure 4.19 shows 
the significance of the one-sided tests and the moving probabilities (section 
2.2.6) for high speed and low speed roads. The combined probabilities are 
displayed in table 4.2. The combined probabilities for two-sided tests are 
presented for comparison. 

On high speed roads the moving probabilities are significant almost 
throughout the volume range. On low speed roads the nonrandomness is 
significant for volumes greater than 700 veh/h. The combined significance 
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Table 4.2: Significance probabilities of the runs tests 


Speed limit [km/h] 

One-sided test 

Two-sided test 

50-70 

0.001 

0.074 

80-100 

0.000 

0.000 


on high speed roads is 1.6 • 10” 20 and on low speed roads 0.001. It is evident 
that the headways are not i.i.d., but clustered. 

Breiman et al. (1969) found only one significant value (0.03) in eight 
runs tests. The combined probability (0.49) is also nonsignificant. This is 
contrary to the results presented above. 


4.4.4 Platoon length 


Vehicle (i) is defined to be a follower (0) if its headway is at most to> oth¬ 
erwise it is a leader (1). The status of a vehicle is accordingly defined as 
follows: 


Xi = 



if ti < to 
otherwise. 


(4.41) 


The difference in speed is ignored. 

The number of vehicles in a platoon is the number of consecutive gaps 
ti < to (followers) plus 1 (leader). The platoon has length k if x\ = 1, 
X 2 = 0, X 3 = 0,..., Xk = 0, and Xk+i = 1. The platoon may consist of a 
single vehicle {k = 1). 

If the headways are i.i.d., the probability of vehicle i being a follower is: 


P = P{h < t 0 } = F(t 0 ). (4.42) 

The platoon length is then geometrically distributed (Drew 1968), and the 
probability for length k is: 

Pk =p k ~ 1 (l -p), k= 1,2,... (4.43) 


The renewal property (i.i.d. headways) can be tested using the null hy¬ 
pothesis: 

H 0 ■ Pk = p k ~ 1 {l- p ) ( 4 . 44 ) 
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Significance 



Figure 4.20: Chi-square tests for geometrically distributed platoon lengths and 
9-point moving probabilities 

against the alternative hypothesis: 

Ha- Pk -p). (4.45) 

The tests are based on platoon criterion to = 5 s, and the chi-square 
goodness-of-fit test is used. The chi-square test was performed with k — 2 
degrees of freedom. One degree of freedom was lost, because p was esti¬ 
mated from the sample. Platoon lengths 1, 2,..., 20 and > 20 were defined 
as distinct classes. They were combined so that the expectation for each 
class was not less than five, except for the last class the expectation was 
not less than unity. On high speed roads one sample was excluded from the 
analysis, because, after combining classes, there were no degrees of freedom 
left. The results of the tests are displayed in figure 4.20. 

The significance probabilities of the low speed samples are rather evenly 
spread, and the moving probability stays above significant values. The sig¬ 
nificance probabilities of the high speed samples are more concentrated on 
the low values. The combined probability is 0.13 for low speed samples and 
1.9 • 10 -9 for the high speed samples. Consequently, the hypothesis of geo¬ 
metric platoon length distribution should be rejected on high speed roads, 
at least for volumes above 500 veh/h. 

In addition to the geometric distribution, several other distributions have 
been proposed as models for the platoon length distribution, such as the 
Borel-Tanner distribution (Tanner 1953, Tanner 1961) and the Miller distri¬ 
bution (Miller 1961) with one and two parameters. Miller (1961) found that 
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his one parameter model fitted the data about as well as the Borel-Tanner 
distribution. Galin (1980) found the Borel-Tanner and the two-parameter 
Miller models more accurate than the geometric distribution or the one- 
parameter Miller model. The headway data were from two-lane Israeli roads. 
The combined significance probability for the geometric distribution can be 
calculated as 0.0001. For comparison, the respective probability for the 
Borel-Tanner distribution is 0.739, for the one parameter Miller distribution 
0.0007, and for the two parameter Miller distribution 0.889. These results 
confirm the rejection of the geometric distribution. 

4.5 Discussion 

After the measurements, trend tests and other preliminary analysis, 63 
trendless samples consisting of 16,417 headways were analyzed. The samples 
were classified into two road categories according to the speed limit: the low 
speed roads (50-70 km/h) and the high speed roads (80-100 km/h). 

The renewal hypothesis was tested using three methods: autocorrelation 
analysis, runs tests and tests for geometrically distributed platoon lengths. 
The significance probabilities of individual samples were combined, and the 
variation of significance over different flow levels was described by moving 
probabilities. In contrary to many previous papers, the headway data indi¬ 
cates that the possibility of a small positive autocorrelation should not be 
neglected. 

A four-stage identification process (Luttinen 1994) is suggested to de¬ 
scribe the properties of the headway distribution. This process combines 
the most powerful analyses presented above: 

1. The empirical probability density function (edf) shows the basic shape 
of the headway distribution more clearly than the empirical distribu¬ 
tion function. It should be estimated by a kernel method in order to 
obtain a continuous estimate, and to avoid the bias due to the his¬ 
togram method. 

2. The empirical hazard function (ehf) gives even more information about 
the statistical properties of headways than the edf. A graphical exam¬ 
ination of both the edf and the ehf gives a clear indication about the 
shape of the headway distribution. 

3. The moments of a distribution provide some numerical measures for 
the identification process. The sample moments may have large varia¬ 
tion, but their relations provide proper measures for the identification. 
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The coefficient of variation is a scaled measure of the distribution 
spread. A CV-chart gives essential information about the headway 
data and proposed theoretical distributions. It is especially worth 
checking, whether the coefficient of variation is near, below, or above 
unity. 

4. The kurtosis and the squared skewness of the headway samples have 
a close, linear or quasi-linear relation, which supports their applica¬ 
tion in the identification. The KS 2 -chart shows very effectively the 
differences between theoretical distributions and the data. 

The four-stage identification process reveals many shortcomings of the 
simple distributions. The exponential distribution does not pass any of the 
identification phases. The hazard function and the coefficient of variation 
reveal the problems of the gamma distribution most clearly. The deviation 
of the lognormal distribution from the headway data is most obvious, when 
the kurtosis and the squared skewness are measured. The empirical hazard 
function indicates that the tail of the headway distribution is exponential. 
Consequently, the four-stage identification process suggests mixed distribu¬ 
tions with an exponential right tail. The process can be applied to further 
evaluate such models. 

The identification process has demonstrated some general properties of 
headways. There is, however, wide variability among the samples and the 
road categories. High speed roads have a higher peak in the empirical den¬ 
sity and hazard functions, a higher proportion of trailing vehicles, and a 
higher coefficient of variation, as well as more significant serial correlation 
between consecutive headways. These differences all relate to the higher pla- 
tooning on high speed roads, which can be explained by many factors: The 
number of passing opportunities depends on the headway distribution of the 
opposite flow. If the passing rate is different from the catch-up rate, the flow 
is not in equilibrium but in a transition state (McLean 1989). The shape of 
the headway distribution changes, as the measurement point is moved along 
the road. When the passing demand exceeds the passing opportunities, it 
can be expected that, moving in the direction of the traffic stream on a lane, 
the speeds will decrease and the platooning increase (Botrna 1986). On a 
non-uniform road, the equilibrium will not be achieved if the sub-section 
is shorter than the required transition length (McLean 1989). In addition, 
speed limit, passing sight distance, lane and shoulder width, climbing lanes, 
flow rate of the opposing traffic, 4 intersection density, and distance to major 

4 When the opposing volume is above 400-450 veh/h, the headway distribution does 
not change relevantly (Heidemann 1993). 
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traffic generators will have an effect on the headway distributions. Conse¬ 
quently, the traffic flow is not stationary in space, but each cross-section on 
each road has its own traffic characteristics. A headway model is necessarily 
a gross simplification of the reality. 

The data from high speed roads have been analyzed previously by Pur- 
sula & Sainio (1985) and Pursula & Enberg (1991). Some comparisons of 
the results are presented below: 

1. The density estimates were not calculated by Pursula & Sainio (1985), 
but frequency diagrams were displayed with bin width 0.5 s and the 
origin at zero. Because the frequencies were not corrected by the 
interval width, the values were lower by a factor of 0.5, as compared 
to the density estimates. The basic shape was, however, the same as 
presented here. As Pursula & Sainio (1985) observed, the mode of the 
headway distributions is between 1 and 2 seconds. 

2. The peak heights are in the same range, when the correction due to 
the interval width is made. The comparison with the exponential 
distribution is different, because the correction is not made in figure 
23 of Pursula & Sainio (1985). If the correction is made, the results 
are very similar. 

3. The coefficients of variation have similar values. The c.v. is above 
unity at volumes less than about 1,500 veh/h. In the data of Pursula & 
Sainio (1985) the c.v. decreases steeply at volumes above 1,200 veh/h. 
In the present data, the decrease begins gradually and at lower vol¬ 
umes. 

4. The platoon percentage is similar here as presented by Pursula & 
Sainio (1985) and Pursula & Enberg (1991). The curve in figure 4.2 
is, however, in the upper part of the range in figure 24 of Pursula & 
Sainio (1985), especially at low volumes. 

A major reason for the differences may be in the data collection prin¬ 
ciple. Trend analysis was not used, but the headways were collected 
in 5 minute time slices. Moreover, the proportion of heavy vehicles 
and passing vehicles did not have any effect on the data collection. In 
the present analysis, the rejection of measure periods with more than 
1 % passing vehicles may have favored samples with more vehicles in 
platoons and no “holes” in the flow. There may also be differences in 
the opposite traffic volumes. 



Chapter 5 

Theoretical headway models 


5.1 Principles of evaluation 

The evaluation of the headway distribution models is based on three con¬ 
siderations: 

1. Reasonability. It is an advantage if the structure of the model 
is based on explicit theoretical reasoning about the characteristics of 
traffic ffow. The parameters of such models can give additional infor¬ 
mation on the properties of traffic flow. 

2. Applicability. In mathematical analysis the model should have a 
simple structure to avoid insurpassable problems, and the existence 
and simple form of the Laplace transform 1 is often an advantage. If 
simulation is considered as an alternative, the generation of pseudo¬ 
random variates should be fast and reliable. Parameter estimation 
should not be too complicated. 

3. Validity. The model should give a good approximation of the real 
world phenomena; i.e., the empirical headway distributions. This is 
tested first by the identification process and finally by the goodness- 
of-fit tests. 

1 Laplace transforms, or some other transforms, are commonly used in probability the¬ 
ory, because they make many calculations easier. The Laplace transforms can be used to 
obtain the moments of a distribution (see footnote 5, page 129). When dealing with sums 
of random variates, the convolution property of the Laplace transform is most helpful: 

fx+y(s) = fx(.s)fy(s). 

(See Kleinrock 1975, Luttinen 1990). 
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Five theoretical distributions were selected for a more detailed analysis: 
the negative exponential distribution, the shifted exponential distribution, 
the gamma distribution, the lognormal distribution and the semi-Poisson 
distribution. The negative exponential distribution is the interarrival time 
distribution of a totally random arrival process, i.e. the Poisson process. 
The shifted exponential distribution avoids the problem of extremely short 
headways by setting a threshold for short headways. The gamma distribu¬ 
tion is a generalization of the exponential distributions. A special case of 
the gamma distribution, namely the Erlangian distribution, has been widely 
used in the traffic ffow theory. The lognormal distribution has a theoretical 
connection to the car-following models (Greenberg 1966). As an example 
of mixed models, the semi-Poisson distribution is studied. It has combined 
the follower headway distribution and the exponential tail most elegantly. 
Some other mixed models are discussed briefly. 

The theoretical models are divided into two classes: the simple distribu¬ 
tions and the mixed distributions. The simple distributions have the same 
model for all vehicles. The mixed distributions typically have two distinct 
headway distributions: one for free-moving and the other for trailing (fol¬ 
lowing) vehicles. 


5.2 Simple Distributions 

5.2.1 Negative exponential distribution 

Properties of the negative exponential distribution 

The negative exponential distribution is the interarrival time distribution 
of the Poisson process. The Markov property 2 makes the distribution ana¬ 
lytically simple to use. Consequently, the distribution is widely used in the 
theory of point processes. In traffic flow theory, the negative exponential 
distribution has been used since Adams (1936). 

A process, which has the following four properties, is a Poisson process 
(Khintchine 1960): 

1. The number of events at the beginning of the process is zero; i.e., 
N{ 0) = 0. 

2 Also called “lack of memory”: 

V{T>t + u\T>t} = ¥{T > m} . 
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2. The process has independent increments: 

r 

P{IV(Zi,ui] = m,..., N(l r ,u r \ = n r } = P {N(li,Ui\ = n, 

i—1 

for all l± < ui < I 2 < U 2 <■■■< l r < u r , r = 1,2, 

3. The process has stationary increments: 

P{JV(a,6] = k} = P{lV(a + c,b + c] = k} 

= P{1V(6 — a) = k} 0 < a < b, c> 0 

4. At any time instant only one event can occur: 

P {N(t, t + A] > 2} = o(A) as A —>• 0. 

The basic statistical properties of the negative exponential distribution 
are presented in table 5.1. Concerning the four-stage identification process, 
the following observations can be made: 

1. The negative exponential density function (fig. 5.1) has mode at zero. 
The peak height is /(0) = A, after which the density decreases expo¬ 
nentially. The distribution overestimates the frequency of extremely 
short intervals and underestimates the frequency of headways near 
the mode of the empirical headway distribution. Longer headways are 
again overestimated (fig. 5.2). 

2. The hazard rate is equal to the flow rate [h(t) = A]. 

3. The coefficient of variation is equal to unity. 

4. The skewness and the kurtosis have constant values, independent of 
the scale parameter (A). 

The negative exponential distribution does not pass the four-stage identifi¬ 
cation process. In fact, it fails in all four stages. 

Parameter estimation 

The expectation of the distribution is the reciprocal of the scale parameter 
(Luttinen 1990). Accordingly, the method of moments estimator (MME) for 
the scale parameter (A) is the reciprocal of the sample mean: 



(5.1) 

(5.2) 

(5.3) 


(5.4) 
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Volume [veh/h] 



Figure 5.1: Negative exponential density function 




Headway [s] 


Headway [s] 


Figure 5.2: Negative exponential pdf’s and sample edf’s 
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Table 5.1: Properties of the negative exponential distribution 


PDF 

F(t) = 1 1 _ e_At ’ itii0 

1 0, otherwise 

pdf 

m = l Xe ~“ if,2 ° 

1 0, otherwise 

Hazard function 

= in -° 

1 0, otherwise 

Laplace transform 


Mean 

,(D = 1 

Median 

Md = — In 2 

A 

Mode 

Mo = 0 

Variance 

" 2 < r > - u 

Coefficient of variation 

C(T) = 1 

Skewness 

a 3 (T) = 2 

Kurtosis 

ot±(T) = 9 


A > 0 


The likelihood function and the loglikelihood function are: 


L{ X) = A n e" AE ?=i^ 

(5.5) 

71 

In L{X) = n In A — A tj. 

(5.6) 




The maximum likelihood estimator (MLE) is obtained by differentiation 

dlnL(X) n 


d\ X 

t. 


5>=° 


(5.7) 


1=1 


(5.7a) 
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P(K-S) P(A-D) 



Figure 5.3: The nonparametric K-S and the parametric A-D goodness-of-fit tests 
for the negative exponential distribution 


The MLE and the MME are identical for the negative exponential distribu¬ 
tion. This estimator has been used in the following statistical analyses. 


Goodness-of-fit tests 

The goodness of fit was tested using the parametric Anderson-Darling (A-D) 
test. For each sample 9,999 replications were generated and analyzed. Figure 
5.3 shows the test results. For comparison, the results of nonparametric 
Kolmogorov-Smirnov (K-S) tests are also presented. 

The combined significance probability of the A-D tests is practically zero 
(2.9 • llT 150 ). Even for the three low volume (less than 160 veh/h) samples, 
the combined significance is 0.00016. Only two small samples (45 and 46) of 
no more than 100 observations yield nonsignificant results. Consequently, 
the negative exponential distribution should be rejected, at least for volumes 
greater than 140 veh/h. 
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Negative exponential distribution in headway studies 

Reasonability The Poisson process describes situations, where vehicles 
without physical length can move freely. Several authors (e.g., Weiss & 
Herman 1962, Breiman 1963, Thedeen 1964) have shown that under rather 
weak assumptions (no driver interactions and i.i.d. speeds) the number of 
vehicles in an arbitrary interval will be asymptotically Poisson distributed 
as time tends to infinity. Thus, the headways will asymptotically follow the 
negative exponential distribution. This is called the Poisson tendency of 
low density traffic. 

Some platooning nearly always occurs on two lane roads, because passing 
is restricted, and even in good passing conditions some drivers are content 
to follow behind. The Poisson process cannot deal with this phenomenon. 

Applicability Because of the Markov property, the negative exponential 
distribution lends itself to demanding mathematical analyses. A further ad¬ 
vantage is the simple form of the Laplace transform. Therefore, the negative 
exponential distribution is often preferred over other, more realistic but also 
more complex distributions. In many cases explicit solutions can only be 
found, when the arrival process is assumed Poisson. 

Because the traffic volume is the only information needed, the parame¬ 
ter estimation is very straightforward. Also, the generation of exponential 
random variates is very effective. Hence, the applicability of the negative 
exponential distribution is excellent. 

Validity The frequency of unrealistically short headways in the negative 
exponential distribution is too large. In fact, extremely short headways have 
the highest probability density. The model gets more distorted as the flow 
rate increases (fig. 5.2). 

On the basis of the Poisson tendency theory, the exponential distribution 
could be used in the study of low density traffic conditions. Wattleworth (in 
Baerwald 1976) suggests flow rates of 500 veh/h or less. The coefficient of 
variation data and the goodness-of-fit tests above suggest even lower volumes 
(about 100 veh/h or less). There is, however, not enough low volume data 
to make accurate recommendations. 

The negative exponential distribution has since Adams (1936) played a 
major role in the theoretical study of traffic flow, and especially in traffic 
signal control. The studies of Garwood (1940), Darroch et al. (1964), and 
many others rely on the exponential headway distribution. Also in some 
modern studies on adaptive traffic signal control, the queue length predictor 
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algorithm is based, at least partially, on Poisson arrivals (Baras, Levine & 
Lin 19796, Betro, Schoen & Speranza 1987). 

In conclusion, the negative exponential distribution can be considered as 
a model for vehicle headways under very low flow conditions and for appli¬ 
cations that are not very sensitive to the shape of the headway distribution. 
In some cases the analytical solution of the problem may only be possible 
with a very simple model. In such situations the limitations of the model 
should be clearly indicated. 


5.2.2 Shifted exponential distribution 

Properties of the shifted exponential distribution 

One possibility to avoid the extremely short headways predicted by the 
negative exponential distribution is, in queuing theory terminology, to add a 
phase with deterministic service time (r) in series with the exponential server 
(Luttinen 1990). This is equivalent to shifting the negative exponential 
distribution r seconds to the right. A headway is then considered as a 
random variate T = r + X, where r is a nonnegative constant, and A is a 
random variate following the negative exponential distribution. 

The density function is obtained from the negative exponential pdf by 
replacing t with t — r: 


f(t\r,9 


0 e -0(t-r), if t>T 
0 , otherwise, 


(5.8) 


where r and 8 are the location and scale parameters, respectively. 

The basic statistical properties of the shifted exponential distribution 
are presented in table 5.2. Concerning the four-stage identification process, 
the following observations can be made: 

1. The pdf (fig. 5.4) has mode at t with peak height /(r) = 9, after 
which the density decreases exponentially. Figure 5.5 shows the shifted 
exponential pdf estimates for four headway samples. The pdf under¬ 
estimates the density near the sample mode, and overestimates the 
frequency of long headways. 

2. The hazard rate is equal to the scale parameter; i.e., h(t) = 9 for t > r. 

3. In terms of flow rate [A = 1 //z(T)], the coefficient of variation is 
C(T) = 1 — At. Because the location parameter is always nonneg¬ 
ative (t > 0 ), the c.v. is less than or equal to unity. 
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Table 5.2: Properties of the shifted exponential distribution 


PDF 

{ 1 - e -0(*- T ) if t> T 

F(t) = { ’ - 

10, otherwise 

pdf 

,, , [ 9e~ e ^~ T \ if Or 

f(t) = < 

1 0, otherwise 

Hazard rate 

M0 = | 9, itf - T . 

1 0, otherwise 

Laplace transform 

f*( s ) = e ~ ST ~r~n 
s 4- 6 

Mean 

n(T) = T+ i 

Median 

In 2 

Md = t H —— 

U 

Mode 

Mo = T 

Variance 

= p 

Coefficient of variation 


Skewness 

03 (T) = 2 

Kurtosis 

«4 (T) = 9 


6 > 0,r > 0 


4. The skewness and the kurtosis have constant values, independent of 
the parameters. 

The shifted exponential distribution does not pass the identification process. 
Like the negative exponential distribution, it fails in all the four stages. 

Parameter estimation 

Method of moments estimators The method of moments estimators 
(MMEs) are obtained by equating the expressions for the mean and the 
variance (table 5.2) to the sample mean (t) and the sample variance (s 2 ): 

1 

t = T + - 
6 


(5.9) 
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Volume [veh/h] 



Figure 5.4: Shifted exponential density function 
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Figure 5.5: Shifted exponential pdf’s and sample edf’s 
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By solving this pair of equations the MMEs are obtained as: 


(5.10) 


f = t — s (5-11) 

e= 1 =\. (5.12) 

S t — T 

If the sample c.v. is greater than unity, the sample standard deviation 
(s) is greater than the sample mean (t). The location parameter estimate is 
then infeasible (f < 0), and the best estimate, in the sense of the method of 
moments, is r = 0, which leads to the negative exponential distribution. 


Modified method of moments estimators The first order statistic 
(T (1) ) in a sample of size n from a shifted exponential distribution /(t | r, 9 ) 
follows the shifted exponential distribution with the scale parameter equal 
to nO (Cohen & Whitten 1988). Thus: 


/( t( l) I r, nd ) 


n9e- ne{t W- T) , 
0 , 


if f (1) > r 
otherwise. 


(5.13) 


T(!) has expectation: 

E(T m ) = T *^- (5 ' 14) 

The estimating equations of the modified method of moments estimators 
(MMMEs) are then: 


E(T) = i=f+l 
E{T (1) ) = t (1) = t + 

The MMMEs are now obtained as follows: 

nt(i) — t 
n- 1 

6 = 

t — T 


(5.15) 

(5.16) 


(5.17) 

(5.18) 


According to Cohen & Whitten (1988) these are both the best linear un¬ 
biased estimators (BLUE) and the minimum variance unbiased estimators 
(MVUE), and therefore they are the preferred estimators. 
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Location parameter 
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Figure 5.6: MMMEs for the location parameter of the shifted exponential distri¬ 
bution 


Figure 5.6 shows the MMMEs for f. The estimates do not correlate with 
the volume. The mean of the estimates is 0.63 s on low speed roads and 
0.38 s on high speed roads. The difference in the estimates is statistically 
significant (P < 0.0005). The two estimates (samples 45 and 46) greater 
than one second must be considered outliers due to small sample sizes (n < 
100). If they are removed, the mean for the low speed roads is 0.59 s, which 
is still significantly larger than the mean for high speed roads. 


Maximum likelihood estimators The likelihood function and its loga¬ 
rithm for the shifted exponential distribution are: 


L(t,9) 
In L(r, 9) 


0n e ~J2]= 

0 , 


if f (1) > T 
otherwise 


n In 9 n - J2j=i - p) , if t(i)>r 
—oo, otherwise. 


(5.19) 

(5.20) 


For any fixed 9, the likelihood function is maximized by choosing r as 
large as possible so that: 


/(ti | r, 9 ) >0 for all* = 1,..., n. (5-21) 


This leads to: 


t = nrin{ti, ...,t n } = t (1) . 


(5.22) 
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If the location parameter estimate t is known, the scale parameter estimate 
(9) is obtained by solving (Dudewicz & Mishra 1988): 


dL(f, 9) 
89 


n 


“ f ) = °> 


i =1 


(5.23) 


which gives: 


9 


1 

t — T 


(5.24) 


Because the probability P {T < t (1) } = ^t(i) |0,f 
bad goodness-of-fit test results. 


is null, the MLEs give 


Modified maximum likelihood estimators The modified maximum 
likelihood estimate (MMLE) for r is calculated from the following equation: 


F {t {1) \f,9) = (5.25) 

The estimator for 9 is the same as given by the maximum likelihood method, 
as well as by the other methods. The MMLEs are accordingly: 


1 

t — T 


(5.26) 

(5.27) 


For sample sizes near and above 100, the MMLEs are virtually equal to the 
MMMEs. The MMMEs were used, because they could be calculated more 
effectively. 


Goodness-of-fit tests 

The goodness of fit was tested using the parametric Anderson-Darling (A-D) 
test. For each sample 9,999 replications were generated and analyzed. Fig¬ 
ure 5.7 shows the test results. For comparison, the results of nonparametric 
Kolmogorov-Smirnov (K-S) tests are also presented. The combined proba¬ 
bility of the A-D tests is 5.3 • 10~ 150 . Even for the three low volume (less 
than 160 veh/h) samples, the combined significance is 0.002. The shifted 
exponential distribution does not score much better than the negative expo¬ 
nential distribution, and it should be rejected, at least for volumes greater 
than 140 veh/h. 
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P(K-S) P(A-D) 



Volume [veh/h] Volume [veh/h] 


Figure 5.7: The nonparametric K-S and the parametric A-D goodness-of-fit tests 
for the shifted exponential distribution 


Shifted exponential distribution in headway studies 

Reasonability The location parameter (r) prevents the occurrence of ex¬ 
tremely short headways, which is a major problem of the negative exponen¬ 
tial distribution. A fixed threshold for the minimum headway is a simplistic 
solution that is not in accordance with the car-following dynamics. The 
shifted exponential distribution is not a reasonable model for headways, but 
its properties are best under low flow rates. 

Applicability If the exponential distribution has a positive location pa¬ 
rameter, it loses the Markov property. The applicability, however, remains 
good. The Laplace transform is not complicated, the parameter estimation 
is straightforward, and pseudo-random variates can be generated efficiently. 
The shifted exponential distribution has been widely used in simulation 
studies (Lin 19856, Lin 1985 a, Shawaly, Ashworth & Laurence 1988). 

Validity The goodness of fit is only a little better than for the negative 
exponential distribution. The shifted exponential distribution can be con- 
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sidered for very low flow conditions, or for applications that are sensitive 
to extremely short headways but not to other properties of the headway 
distribution. 

In some applications, it may be better to choose a location parameter 
larger than t(iy Statistically such headway distributions are infeasible, but 
they may give better results, because the mode moves closer to the sample 
mode. The validity of the application should be judged by verifying the 
accuracy of the results. 

5.2.3 Gamma distribution 
Properties of the gamma distribution 

The shifted exponential distribution avoids the problem of extremely short 
headways by adding a constant (r) to an exponential random variate. In 
queuing theory terminology this means adding a server with constant service 
time in series with a server having negative exponential service time distri¬ 
bution (Luttinen 1990). Another approach is to put a exponential servers in 
series, thus reducing the probability of extremely short departure intervals 
(Luttinen 1990). In both cases, the service is in stages, so that only one 
server can be occupied at a time. 

The resulting departure interval is a sum of a mutually i.i.d. exponential 
random variates: 

S a = X] + X 2 + • • • + X a , o £ {1,2,...}. (5.28) 

The new random variate ( S a ) is said to follow the a-phase Erlangian distri¬ 
bution. 

The probability distribution function of the Erlangian distribution is: 

F(f| / 3,a) = l-f;»e-'* (5.29) 

3=0 3- 

The probability density function (fig. 5.8) follows by differentiation: 

f(t\P,a)= ^ Pe~ ,3t . (5.30) 

(a - 1)! 

The negative exponential distribution is a special case (a = 1) of the Erlan¬ 
gian distribution. 

If the number of phases in the Erlangian distribution is allowed to be non¬ 
integer (a > 0), the resulting generalization is called the gamma distribution. 
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Figure 5.8: 3-phase Erlangian density function 


The proportion of short headways can be further adjusted by introducing a 
location parameter (r). The pdf is then: 



(5.31) 


where 



(5.31a) 


is the gamma function. It is a generalization of the factorial function: 


(x — 1)! = T(x). 


(5.32) 


The parameters of the distribution are called the location parameter (r), 
the scale parameter (/3) and the shape parameter (cc). 

The basic statistical properties of the gamma distribution are presented 
in table 5.3. Concerning the four-stage identification process, the following 
observations can be made: 

1. If the shape parameter (a) is equal to unity, pdf (fig. 5.9) has mode at 
r with peak height /(0) = (3 (i.e., the shifted exponential distribution). 
As a increases above unity, the mode approaches the mean. If a is 
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less than unity, the mode is at r, and the peak height rises to infinity: 
lim t^ T +f(t) = oo. Figure 5.10 shows gamma pdf estimates for four 
samples. 

2. The hazard rate is h(t) = /3 for a = 1, monotone decreasing from 
infinity for a < 1, and monotone increasing from zero for a > 1 (see 
figures 5.11 and 5.12). If a / 1, the hazard rate approaches f3 as t 
increases. For t < r the hazard rate is h(t) = 0. 

3. If r = 0, the coefficient of variation is equal to unity for a = 1, less 
than unity for a > 1 and greater than unity for a < 1. As r increases, 
the coefficient of variation decreases. 

4. The skewness [« 3 (T)] and the kurtosis [cc 4 (T)] are functions of the 
shape parameter (cc) only. Both are positive and decrease, as the 
shape parameter increases. The kurtosis and the squared skewness 
have the following relation: 

a 4:(T) = 3+ — a\(T). (5.33) 

For a corresponding value of squared skewness, the kurtosis is a little 
larger than the headway kurtosis (fig. 4.15, page 73). 

The gamma distribution does not pass the four-stage identification pro¬ 
cess. The hazard function is most informative in this respect. Also, for 
(cc > 1) the coefficient of variation decreases below unity and, as a increases, 
the distribution soon becomes too smooth and too symmetric. 


Parameter estimation 

Method of moments estimators The estimating equations for the 
MMEs are: 


t = f + 


a 



(5.34) 

(5.35) 


(5.36) 



104 


CHAPTER 5. THEORETICAL HEADWAY MODELS 


Table 5.3: Properties of the gamma distribution 


PDF 

('![a,(i(t-T)\ 

F(f)= r (o ) ■ ,ttiT 

[0, otherwise 

pdf 

m = \ T(a) ’ - 

[0, otherwise 

Hazard rate 

*« ^ T^M 

Laplace transform 

f -<*>='-'C+„) 

Mean 

p(T) = r + ^ 

Median 

Md = 

Mode 

I a ~ 1 i 

Mo = l (i 

1 r, otherwise 

Variance 


Coefficient of variation 

c ^- a fgr 

Skewness 

« 3 (T) - -4 

Va 

Kurtosis 

CX4(T) = 3 H— 
o; 


a, j3 > 0, r > 0 
7 (a,y) = Jq x a ~ 1 e~ x dx 
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Figure 5.9: Gamma density functions (r = 0) for flow rate 600 veh/h 
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Figure 5.10: Gamma pdf’s and sample edf’s 
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Figure 5.11: Gamma hazard functions (r = 0) for flow rate 600 veh/h 
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Figure 5.12: Gamma hazard functions (r = 0, a = 3) 
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There is an explicit solution: 


_ - 2 s 

T = t - 

(5.37) 

OL 3 

P=~ 

(5.38) 

S«3 

4 

a 2 ‘ 

(5.39) 

«3 



The estimate of the shape parameter (a) is directly proportional to the 
squared sample skewness ( 0 : 3 ) and independent of the location parameter. 
This may result in a serious fault (Bowman & Shenton 1988). The efficiency 
of the shape parameter estimate can be as low as 22 % (Stuart & Ord 
1991). Furthermore, the location parameter estimate can be greater than the 
smallest sample observation t^, and hence inadmissible (Cohen & Whitten 
1988). 

The method of moments is inefficient in parameter estimation, except for 
distributions closely resembling the normal (Greenwood & Durand 1960). 
Headway distributions are positively skew and do not resemble the normal 
distribution. Preliminary studies by the author have demonstrated the poor 
performance of the MMEs for the gamma distribution. 

A MME can be used to make a crude estimate of the range of a. Figure 
4.14 (page 73) shows that the sample skewness is mostly greater than two 
and in all cases greater than 1.25. This means that the shape parameter 
values are expected to be mostly below unity, and not much greater than 
three. This information is useful in the evaluation of other methods. 

Cohen & Whitten (1988) have suggested modified method of moments 
estimators (MMMEs), because they are easy to calculate and applicable 
over the entire parameter space. The estimating equations are: 


-- _ a 

t = T + — 

(3 


s 


2 



f, $, a 


a 

J 2 

1 

n + 1 


(5.40) 

(5.41) 

(5.42) 
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Maximum likelihood estimators 

The likelihood and the log-likelihood functions of the gamma distribution 


are: 


L(t, (3, a) = JJ /(tj | T,P,a) 

3 = 1 

n n 




T(a) 


(5.43) 


n^' 

3 =1 
n 


- r ) a ~ 1 e~ l3( ' tj ~ T ' ) 


In L{t, f3, a) = n[a In (3 — InT(a)] + ^(a - 1) ln(tj - r) - (3(tj - r). 

3 = 1 

(5.44) 

The equations for the MLEs are obtained by equating to zero the partial 
derivatives and dividing by n: 


1 dhiL(t,j3,a) _ (3 | 1 1 _ Q 

1 — d n tj 


n(l — d) df 

1 din L(t, (3, d) d 1 


3=1 3 


- T 


n 


d(3 


(3 


n 


- f ) = 0 


(5.45) 

(5.46) 


3 =1 


1 dlnL(f, (3, a) 


n da 
where T(-) is the digamma function: 

’k(x) = 


ln/3 — ’l'(d) 4—In (tj — f) = 0, (5-47) 


i=i 


d[lnT(x)] 

dx 


(5.47a) 


The equations can be expressed in terms of modified arithmetic (A), 
geometric (G) and harmonic ( H ) means: 


A = - 


3=1 


G = 


N 


3=1 


- T 


n 

H = -T 

n 


n ' tj — f 

j=l J 


(5.48) 

(5.49) 

(5.50) 
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as follows (Bowman & Shenton 1988): 


/ = H 

a — 1 

(5.51) 

^ = A 

P 

(5.52) 

'L(d) — In (i = In G. 

(5.53) 


If the shape parameter estimate a is less than unity, the likelihood func¬ 
tion (L) becomes infinite as the location parameter approaches the first order 
statistic of the sample: 

lim L(t , (3, a) = oo. (5.54) 


In this case f = but P and a fail to exist since (Cohen & Whitten 1982): 


linr 


dL(f, (3, a) 
da 


—oo. 


(5.55) 


If the shape parameter estimate d is less than unity, the left hand side of 
equation (5.51) becomes negative, indicating that some observations should 
be less than f, which is infeasible. As a conclusion, the maximum likelihood 
method cannot be applied for shape parameter values less than unity, which 
is mostly the case in the headway data. 

Johnson et al. (1994) observe that the MLEs are rather unstable if a is 
near to unity, even though it exceeds it. They recommend that the MLEs 
should be used only if it is expected that a is at least 2.5 ( 0:3 < 1.27). 


Modified maximum likelihood estimators 


As explained above, shape parameter estimates less than unity are to be 
expected in the headway data. The MLEs are then infeasible, but the max¬ 
imum likelihood method can be modified (algorithm 5.1) by replacing the 
equation (5.51) by: 

F( f (1) \f,(3,a) = —j—• (5.56) 

Solving for the scale parameter estimator in equations (5.52) and (5.53), the 
following equations are obtained: 


ln(d) — 'L(b) = In 



P = 


a 

A' 


(5.57) 


(5.58) 
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Step 1 Give two initial location parameter estimates. Set fo to some large 
value. Assign f\ = (1 — e)im, where e is the stopping criterion (see 
step 2). Assign j = 1. Go to step 2. 

Step 2 Determine shape and scale parameter estimates. The shape param¬ 
eter estimate (d) is computed using algorithm (5.59). The scale pa¬ 
rameter estimate (fij) is given by equation (5.58). If | fj — fj-\\/fj < e, 
go to step 4, else go to step 3. 

Step 3 Determine new location parameter estimate (fj+i) from equation 
(5.61). If fj .|-i > t(i), go to step 4, else increment j by 1 and go to 
step 2. 

Step 4 Output parameter estimates (f,/3, d). Stop. 


Algorithm 5.1: Calculation of the MMLEs for the gamma distribution (Luttinen 
1991) 


The shape parameter estimator d is solved using a rapidly converging 
algorithm (Bowman & Shenton 1988): 


aj = dj-i[ln(d.,-i) - ^(dj-i)]/^ j = 1,2, 


(5.59) 


where 




0 <« 


(5.59a) 


and 


Oi o — 


«o — 


0.5000876 + 0.1648852£ - 0.0544276^ 2 

? 

8.898919 + 9.059950^ + 0.9775373£ 2 
^(17.79728 + 11.968477^ + £ 2 ) ’ 


, 0 < ^ < 0.5772 (5.59b) 


0.5772 < 


(5.59c) 


If a > 1, then 0 < ln(d) — ^(d) < 0.5772 and £ > 0.5772 leads to no 
solution. 

The location parameter estimator can be solved by finding the f, which 
gives probability l/(n + 1) for a headway less than or equal to the shortest 
sample headway: 

—\ =t( i), (5.60) 

n +1 / w 


ri— 1 
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where F 1 (-) is the inverse of the gamma PDF. Once an old estimator fj is 
known, the new estimator (fj+i) can be obtained as follows: 


Tj+l 


fj + f ( i) - F 1 


n + 1 


fj,P,a 


(5.61) 


Accuracy of parameter estimates 


The accuracy of the parameter estimates was evaluated by a C-language 
computer program calling IMSL Fortran subroutines (IMSL 1989). For each 
combination of parameters, 1,000 replicas of 365 observations were gener¬ 
ated. The accuracy demand for location parameter estimate was e = 10~ 6 . 
The subroutine RNGAM was used to generate the observations, and the 
parameters were estimated using the modified maximum likelihood method 
(algorithm 5.1). The accuracy of the estimates was measured by a loss 
function called relative root mean square error ( rrmse ) (Bai, Jakeman & 
Taylor 1990): 


rrmse(9) 


- V 


m '9j-e" 2 


m■ 


3 =1 


(5.62) 


where m is the number of replications, 9 is the true value of the parameter, 
and 9j is the parameter estimate for replica j. 

Table 5.4 presents the rrmse 's for different values of location (r) and 
shape (a) parameters. The scale parameter (/?) is set to unity to make the 
table comparable with the data presented by Bai et al. (1990). 

Some accuracy is lost as compared to the maximum likelihood method, 
especially when the shape parameter (a) is large. The restriction of a < 1 
is, however, avoided. The accuracy is quite good for a < 1. The loss of 
accuracy in the location parameter estimate (f) with large values of the 
shape parameter is unavoidable. Consequently, the accuracy of the MMLEs 
is considered appropriate for the shape parameter values near or below unity. 

Bowman & Shenton (1988) consider the MMLEs particularly successful 
for 1/4 < a < 1, and quite acceptable for 1 < a < 2. (1/4 was the 

lowest shape parameter value that they tested.) In the goodness-of-fit tests 
below, the parameters were estimated by the modified maximum likelihood 
method. This method uses all the information in the samples, and it is not 
as sensitive to outliers as the moment methods (Bowman & Shenton 1988). 

Figures 5.13, 5.14 and 5.15 show the parameter estimates for the gamma 
distribution. The location parameter estimates are mostly between 0.2 and 
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Table 5.4: Relative root mean square error (rrmse) of MMLEs for the gamma 
distribution. The rrmse' s of MLEs in (Bai et al. 1990) are shown in 
parentheses. 


a 


e 

r = 0.2 

r = 0.6 

r = 

1.0 



a 

0.063 

0.061 


0.062 

a 

= 0.50 

(3 

0.098 

0.094 


0.094 



T 

0.000 
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= 6.00 

(3 

0.124 

0.144 

(0.16) 

0.170 



T 

2.011 

0.944 

(0.65) 

0.751 


0.8. The scale parameter rises exponentially as the volume increases. The 
shape parameter estimates are concentrated between 0.4 and 3.1. On high 
speed roads the shape parameter is larger at high volumes. On low speed 
roads the correlation is not significant. 

Goodness-of-fit tests 

For each sample the parameters were estimated using the modified maxi¬ 
mum likelihood method. The goodness of fit was tested using Monte Carlo 
tests with the A-D statistic (section 2.2.4). For each sample 9,999 replica¬ 
tions were generated and analyzed. Figure 5.16 shows the test results. For 
comparison, the results of nonparametric K-S tests are also presented. 

As figure 5.16 shows, the goodness of fit is poor. The points are not 
evenly distributed between 0 and 1 at any flow level. The combined signifi¬ 
cance of the A-D tests is 4.1 • 10 -141 . For volumes less than 160 veh/h the 
K-S tests give combined significance probability of 0.25. The corresponding 
probability of the A-D tests is 0.001. 

In figure 5.17, the nonparametric K-S tests are compared with the para- 
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Figure 5.13: Location parameter estimates (MMLE) for the gamma distribution 


Scale parameter 



Figure 5.14: Scale parameter estimates (MMLE) for the gamma distribution 
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Shape parameter 



Figure 5.15: Shape parameter estimates (MMLE) for the gamma distribution 


P(K-S) P(A-D) 



Figure 5.16: The nonparametric K-S and the parametric A-D goodness-of-fit tests 
for the gamma distribution 
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Figure 5.17: Comparison of the nonparametric K-S and the parametric A-D tests 
for the gamma distribution 


metric A-D tests. The figure confirms the statement that the nonparametric 
K-S test gives too conservative results (section 2.2.4). The 5 % significance 
level of the parametric A-D test is about the same as the 20 % significance 
level of the nonparametric K-S test. A similar result was obtained by Lil- 
liefors (1969) for the negative exponential distribution. Since the shape 
parameter estimates are concentrated near unity (mean 1.09, standard de¬ 
viation 0.62), the similarity of the results is to be expected. 

Gamma distribution in headway studies 

Reasonability The gamma distribution with a > 1 has a bell-like shape, 
which gives low probability to extremely short headways. The coefficient 
of variation, however, decreases below unity. Consequently, the the shape 
parameter a is estimated below unity in most samples. In these cases, the 
problem with extremely short headways is even more critical than with the 
negative exponential distribution. The location parameter can be used to 
ease the problem, but then the original motivation for the application of the 
gamma distribution is lost. 

Figure 5.18 shows the skewness of the headway data and estimated 
gamma distributions. Although figure 4.15 indicated that the relation be¬ 
tween the skewness and the kurtosis of the gamma distribution does not 
differ much from the headway data, figure 5.18 shows that a gamma dis¬ 
tribution cannot model very skew («3 > 3) headway distributions. In fact, 





116 


CHAPTER 5. THEORETICAL HEADWAY MODELS 


Gamma distribution 



Sample 


Figure 5.18: Skewness of the headway data and fitted gamma distributions 


the skewness of the fitted gamma distributions does not correlate positively 
with the skewness of the empirical headway distributions. The MMEs would 
give different results, but the fact remains that the properties of the gamma 
distribution are very different from the properties of the headway distribu¬ 
tions. 

As a = 1 the gamma distribution is equal to the negative exponential 
distribution (r = 0) or the shifted exponential distribution (r > 0). The dis¬ 
cussion on these distributions is relevant here also. The gamma distribution 
can therefore be justified as a model for low traffic fiow conditions. 

When the fiow rate is high, most vehicles are followers, and the headways 
are concentrated near the mode. The bell-like shape of the corresponding 
headway distribution is typical of the gamma distribution with large a. How¬ 
ever, as the the flow rate increases, the headway distribution becomes more 
skew (fig. 4.14, page 73), whereas the skewness of the gamma distribution 
decreases for large a. 

Applicability The gamma distribution is rather simple in mathematical 
analysis, especially when dealing with transforms, or if the analysis is re¬ 
stricted to the Erlangian distribution. Efficient subroutines can be found to 
generate gamma variates. 

Parameter estimation of the gamma distribution presents some problems, 
but in many practical situations reasonable results can be obtained by letting 
f = — e (where e is some small value), and estimating a and (3 by the 
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method of moments. 


Validity For shape parameter values greater than unity, the density 
function has the skewed bell-like shape typical of headway distributions. 
Goodness-of-fit tests do not, however, support the use of the gamma distri¬ 
bution in traffic flow studies, except perhaps at very low and very high flow 
rates. There is not, however, enough data to make accurate recommenda¬ 
tions. At low flow conditions, the exponential distribution is usually more 
simple, and both theoretically and statistically justifiable. At higher vol¬ 
umes the gamma distribution can be given preference over the exponential 
distribution. 

In the tradeoff between applicability and validity, the gamma distribu¬ 
tion offers a little more credibility and a little more hard work than the 
negative exponential distribution. The pdf of the gamma distribution is 
appealing, when the shape parameter is greater than unity, but it fails to 
model both the strong accumulation of headways near the mode and the 
skewness of headway distributions. If the gamma distribution is used as a 
model for vehicle headways, the process studied should not be very sensitive 
to the shape of the headway distribution. 


5.2.4 Lognormal distribution 
Properties of the lognormal distribution 

If a variate Y = ln(T — r) follows the normal distribution N{ji. ex 2 ), then T 
follows the lognormal distribution: 


f(t \t,h,<j 




(5.63) 

t > T 


The lognormal relation holds if the change in a headway during a small 
time interval is a random proportion of the headway at the start of the 
interval, and the mean and the variance of the headway remain constant 
over time. Greenberg (1966) has shown the resulting connection between 
the lognormal headway distribution and car-following theory. 

The most important properties of the lognormal distribution are pre¬ 
sented in table 5.5. Concerning the four-stage identification process, the 
following observations can be made: 
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Table 5.5: Properties of the lognormal distribution 


PDF 

J (•!„(«-r)- M N 

m = \ v o- ) 

[0, otherwise 

pdf 

[0, otherwise 

Hazard rate 

h(t) - /(t) 

[ ) 1 - F(t) 

Laplace transform 

— 

Mean 

H(T) = T + e^+^ 2 

Median 

Md{T) = r + e^ 

Mode 

Mo(T) =T + e^~ a2 

Variance 

a 2 {T) = e 2 ^ 2 - l) 

Coefficient of variation 

sj e S2 — 1 

C(T) = 

1 + re _At_ 5 s 

Skewness 

a 3 (T) = (u> + 2 )s/(jO — 1 

Kurtosis 

a 4 (: T) = w 4 + 2w 3 + 3w 2 -3 


fi, a > 0, t > 0 


to = e 
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1. The lognormal pdf (fig. 5.19) has a shape similar to the empirical head¬ 
way density function, especially at high flow rates. As the coefficient 
of variation decreases, the mode moves towards the mean (fig. 5.20). 
A closer examination of the pdf (fig. 5.21), however, shows that the 
decrease of the density after the peak is too slow. 

2. The lognormal hazard function (figs. 5.22 and 5.23) is zero at t = 0, 
increases to maximum, and decreases gradually, approaching zero as 
the headway becomes large (Kalbfleisch & Prentice 1980). The hazard 
function is similar to the empirical hazard function, but the fall after 
the peak is more gradual. 

3. The coefficient of variation does not present any limitations for the 
lognormal distribution. It r = 0, the c.v. is: 

C(T) = yje? - 1. (5.64) 

4. For a corresponding value of the squared skewness, the kurtosis of the 
lognormal distribution is too large. See figure 4.15 on page 73. 


The lognormal distribution differs most from the headway data, when the 
kurtosis and the squared skewness are measured. In addition, the pdf and 
the hazard function, in particular, fall too gradually after the peak. The log¬ 
normal distribution fails in three stages of the identification process. Stages 
2 and 4 are most clear in this respect. 


Parameter estimation 

Maximum likelihood estimators The likelihood function of the lognor¬ 
mal distribution is: 

exp (_ wn NMlif ) 

L(t,h,(j) = - yaV^TTj r). (5.65) 

j =i 

This function has global maximum at f = = —oo,a 2 = oo regard¬ 

less of the sample (Crow & Shimizu 1988). Consequently, the maximum 
likelihood method is not feasible. 
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Volume [veh/h] 



f(t) 


Headway [s] 


Figure 5.19: Lognormal density function [ C(T ) = 1.2] 


Coefficient of variation 



Headway [s] 

Figure 5.20: Lognormal density function for flow rate 600 veh/h 
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Figure 5.21: Lognormal pdf’s and sample edf’s 


Volume [veh/h] 
1400 



Figure 5.22: Lognormal hazard function [C(T) = 1.2] 
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Coefficient of variation 
2 


Headway [s] 


Figure 5.23: Lognormal hazard function for flow rate 600 veh/h 


Local maximum likelihood estimators Local maximum likelihood es¬ 
timating equations for the lognormal distribution are obtained by differen¬ 
tiating the loglikelihood function (Cohen & Whitten 1980): 


din L(t, p, a) 1 A\ In (tj — t) — p, 


df 


u 


£ 

3 = 1 


t-j — f 


+ £ 


-1 D 


3 = 1 


t-j — T 


= 0 


din L(t, fi, d) 1 


dji 




5Z[ln(ij -f)-p] = 0 


3 = 1 


din L(t , fi, d) n 1 A 2 

- = j “ T ) “ W = 0. 

a (j° z —' 

1=1 


dd 


(5.66) 

(5.67) 

(5.68) 


By eliminating fi and d from these equations, the equation for r is obtained: 



- n n ^ , n n 2 - 

Info - f) - ln 2 (tj - r) + - ( Info - f) J 
L i=i i=i V i=i ' J 


n^ ln^/) = 0. (5.69) 

i “ ? 7 " 

1 = 1 J 


The local maximum likelihood estimate (LMLE) for f can now be calculated 
iteratively. Only values r < fo) are accepted. 
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After f has been solved, the scale and shape 3 parameters can be esti¬ 
mated as follows (Cohen & Whitten 1980): 

1 n 

A=-5Z ln (^ _f ) ( 5 - 7 °) 

3=1 

1 n /1 n \2 

^ 2= • ( 5 - 71 ) 
1=1 V 3=1 ' 

Although the LMLEs gave the best estimates of all the methods exam¬ 
ined, the method had convergence problems—and a series of Monte Carlo 
tests with 9,999 replications of 63 samples is very sensitive to convergence 
problems. As a result, the local maximum likelihood method was rejected. 


Modified maximum likelihood estimators The local maximum like¬ 
lihood method can be modified as explained on page 34. Equation (5.66) is 
replaced by: 

= t(i). (5.72) 

This can be expressed in terms of the inverse normal PDF: 


F 


-l 


n + 1 


T ” 1 


n + 1 


ln(t ( i) - f) - ft 
a 


(5.73) 


Now the set of equations consists of equations (5.67) and (5.68), and the 
above equation. By eliminating /j and a from these equations, the estimating 
equation for t is obtained (Cohen & Whitten 1980): 


MHq ~ f ) ~-'EM* 


- T 


3=1 


- $ 


~1 


n + 1 


n 


Xj" 2(/ .' 


3=1 


~T) ~ | - 

n 


J2 


— T, 


3=1 


= 0. (5.74) 


This equation can be solved iteratively. The scale and shape parameter 
estimates follow from (5.70) and (5.71). 

The moment estimators for the lognormal distribution have a large sam¬ 
pling error, and they are not uniquely determined. The modified moment 


3 For terminology see (Crow & Shimizu 1988). 
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P(K-S) P(A-D) 



Volume [veh/h] Volume [veh/h] 

Figure 5.24: The nonparametric K-S and the parametric A-D goodness-of-fit tests 
for the lognormal distribution 


estimation of Cohen & Whitten (1980) was also applied. The estimates 
were not as accurate as the modified maximum likelihood estimates. The 
local maximum likelihood method gave the most accurate estimates, but it 
was rejected because of convergence problems. In the goodness-of-fit tests 
below the parameters were estimated by the modified maximum likelihood 
method. 


Goodness-of-fit tests 

The goodness-of-fit was tested using the parametric Anderson-Darling (A-D) 
test. For each sample 9,999 replications were generated and analyzed. Figure 
5.24 shows the test results. For comparison, the results of nonparametric 
Kolmogorov-Smirnov (K-S) tests are also presented. 

The results are better than for the exponential or the gamma distri¬ 
bution, but the combined probabilities are still practically zero (A-D test: 
1.80 • 10~ 114 , K-S test: 1.84 • 10~ 36 ). The fit is better on low speed roads 
(A-D test: 1.78 • 10~ 15 ), but the hypothesis of lognormal vehicle headways 
should be rejected. 
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Figure 5.25: Comparison of the tests for the lognormal distribution 


The comparison of the K-S and the A-D tests (fig. 5.25) shows that the 
20 % significance level of the nonparametric K-S test is about equal to the 1- 
2 % significance level of the parametric A-D test. This, again, demonstrates 
the superiority of the parametric tests over the standard K-S test. 


Lognormal distribution in headway studies 

Reasonability Daou (1964) reported a good fit to lognormal distribu¬ 
tion of spacings within platoons. He considered space headways less than 
200 ft (61 nr) with speed differences less than 5 ft/s (5.5 km/h). Two years 
later Daou (1966) presented a more detailed analysis of the data and also 
a theoretical justification for using lognormal distribution as a model of 
constrained headways. 

Greenberg (1966) observed that “there may be some ‘universal’ law of 
traffic described (or at least approximated) by log-normal distribution”. The 
lognormal relation holds if the change in the headway of a vehicle in any 
small interval of time is a random proportion of the headway at the start 
of the interval, and the mean headway of a vehicle and the variance of its 
headways remain constant over time. Greenberg also showed a connection 
between this model and the car-following models. 

According to Baras, Dorsey & Levine (1979a), multiplicative, indepen¬ 
dent, and identically distributed errors by various drivers attempting to 
follow each other combine to give a lognormal density. Consequently, the 
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lognormal distribution appears to be an attractive model for follower head¬ 
ways. 

Applicability The modified maximum likelihood method requires raw 
headway data and the application of numerical methods, but modified mo¬ 
ment estimates are relatively easy to calculate, although with some loss of 
accuracy. Lognormal random variate generation is relatively efficient with 
currently available routines. The lack of Laplace transform, however, limits 
the applicability of the lognormal distribution. 

With currently available routines to calculate the PDF of the normal 
distribution, even in pocket calculators, there is potential for a wider appli¬ 
cation of the lognormal distribution. In mathematical analysis the lognormal 
distribution is more complicated than, say, the gamma distribution. 

Validity Tolle (1971) studied the three parameter lognormal distribution 
for headway data from a corridor on Interstate 71 in Ohio. The traffic vol¬ 
umes ranged from 800 to 1,900 veh/h. The location parameter was fixed 
(r = 0.3). The K-S tests gave nonsignificant results in all samples at 5 % 
level. Six out of eleven samples produced nonsignificant values at 5 % sig¬ 
nificance level using the chi-square test. The results were best under lowest 
flow (800-1,200 veh/h) conditions. The significance probabilities were not 
presented. 

Branston (1976) studied headway data from M4 motorway in West Lon¬ 
don and from 50 sites on two-way roads around Bloomington, Indiana. The 
results of the chi-square tests 4 for the 3-paranreter lognormal distribution 
were not very encouraging. Branston also used the lognormal distribution 
as a model for constrained headways in his lognormal generalized queuing 
model (see section 5.3.7). 

As a statistical model for vehicle headways, the lognormal distribution 
has two major problems (Luttinen 1994): 

1. The goodness of fit, although better than for the exponential or the 
gamma distributions, is not acceptable. 

2. The mathematical analysis presents some problems, because the log¬ 
normal distribution does not have a Laplace transform. If it is neces- 

4 Branston used average probability as a combined measure for goodness of fit. This, 
however, gives an unduly optimistic view. As an example: The five samples from the slow 
lane on M4 gave average probability 0.105, but the combined probability is 0.0002. For 
the mixed model, the 11 samples from loop 10 (Indiana) gave the best average probability 
0.327, but the combined probability is only 0.036. 
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sary to use numerical or Monte Carlo methods, one may as well use a 
distribution which fits the headway data better. Thus, the lognormal 
distribution is neither simple enough nor realistic enough. 

Consequently, the lognormal distribution has a limited use as an overall 
headway distribution model, but it deserves proper attention in studies on 
follower headways. 


5.3 Mixed distributions 

5.3.1 Structure of the model 

The main problem of the simple distributions is their inability to describe 
both the sharp peak and the long tail of the headway distribution. Even 
at low volume conditions, there is a considerable accumulation of head¬ 
ways near the mode. Furthermore, the empirical hazard functions (figs. 4.6 
and 4.7, page 62) indicate that the tail is exponentially distributed. These 
properties of the headway distributions suggest two vehicle categories—free 
flowing and following. Because vehicles in different categories have different 
headway properties, the division should be included in the model, which 
accordingly becomes a mixture of two distributions. 

According to Dawson & Chimini (1968) a vehicle is considered free, if: 

1. The headway is of “adequate” duration. 

2. The free vehicle is able to pass so that it does not have to modify its 
time-space trajectory, as it approaches the preceding vehicle. 

3. A passing vehicle has sustained a positive speed difference after the 
passing maneuver so that the free vehicle is still able to operate as an 
independent unit. 

The other vehicles are followers. The vehicles could also be classified into 
four or five categories: 

1. Free flowing vehicles 

2. Followers 

3. Vehicles in a transition stage from free flowing to follower 

4. Vehicles starting a passing maneuver. 
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Figure 5.26: Queuing model for a mixed headway distribution 


A fifth category would include the vehicles in the state of passing, but these 
vehicles are driving on the other lane. 

The vehicles with moderate headways are a mixture of categories 1, 2, 
3 and 4. Because it is difficult to make a distinction between vehicles in 
categories 2, 3 and 4, and in order to keep the model simple, the headway 
distribution is considered as a mixture of free headways and constrained 
(follower) headways only. A follower is then any vehicle in category 2, 3 or 
4. 

The most common approach with two vehicle categories is to consider a 
mixture of two probability distributions. A vehicle ( X ) is either a follower 
(X = 1) or a free flowing vehicle (X = 2). In terms of queuing theory, this 
model may be described as a system of two servers, only one of which may 
serve at a time (fig. 5.26). There is a continuous queue of vehicles to the 
system. Each vehicle is served by server 1 with probability p and by server 
2 with probability 1 — p. Server i has service time distribution The 

headway distribution model is the distribution of departure intervals from 
the system. This system differs from an ordinary queuing system in two 
respects: 


1. The system is always busy because of a continuous queue to the system. 


2. Only one customer (vehicle) may be in the system at a time. 


The model may be analyzed by the methods of the queuing theory, and 
especially by the method of stages (Cooper 1981, Luttinen 1990). 
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5.3.2 Properties of the model 

The density functions for constrained headways [fi{t)] and free headways 
[/ 2 (t)] are the conditional densities: 

h(t) = f(t \X = 1) (5.75) 

f 2 (t) = f(t \X = 2). (5.76) 

If the proportion of constrained headways is p, the probability density func¬ 
tion (pdf) of the headway distribution is: 

f{t) = pfi{t) + (1 - p)f 2 (t). (5.77) 


The hazard function is: 


fjt) = pfljt) + (1 -p)f2(t) 

1 -Fit) 1 - [pF^t) + (1 - p)F 2 [t)Y 


The distribution has Laplace transform: 


(5.78) 


r(a)=P/i(s) + (l-p)/2W (5-79) 


where f *(s) is the transform of fi(t), if the transform exists. 

The mean is: 

p(T)=pp(T 1 ) + (l-p)p(T 2 ). (5.80) 

where /i(T*) is the expectation of headway variate T* from distribution fi{t). 
Higher noncentral moments are obtained similarly: 5 


E(T r ) = pE(T{) + (1 - p)E(Tf). (5.81) 


Variance, skewness and kurtosis can be calculated using the relations be- 


5 The moments can also be obtained by using the derivatives of the Laplace transform: 

-<rr« I 


E(T r ) = (-1) 1 " 


ds r 
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tween central and noncentral moments: 


a\T) 


MT) 


a 4 (T) 


E[T-^(T)} 2 


(5.82) 


E(T 2 ) - [h(T)} 2 
E[T~v(T)} 3 
a\T) 


(5.83) 


{ E(T 3 ) - 3 li(T)E(T 2 ) + 2[MT)] 3 } 

E[T - p{T)} 4 
a\T) 


(5.84) 


{E(T 4 ) - 4 ii(T)E(T 3 ) + 6 [r(T)} 2 E(T 2 ) - 3 [p(T)] 4 } . 


Some proposed mixed distribution models are presented below. Detailed 
analysis and goodness-of-fit tests for all these distributions are beyond the 
scope of this research. The semi-Poisson distribution is given considerable 
attention, but other mixed distributions are discussed more briefly, using 
only information that has been readily available. Because all the models are 
based on the assumption of exponentially distributed large headways, the 
exponential tail hypothesis is examined first. 


5.3.3 The exponential tail hypothesis 
Goodness-of-fit tests 

The exponential tail hypothesis states that headways greater than a thresh¬ 
old (S) follow the negative exponential distribution: 

Ho'. 1 — F n (t) ~ e~® t , t > 5, (5.85) 

where 8 is the scale parameter estimate and n is the number of headways 
greater than 5 in the sample. The test has two stages: 

1. Determination of 5 

2. Goodness-of-fit test for headways greater than 5. 

The goodness-of-fit tests were conducted using threshold values 0-14.5 in 
increments of 0.5. Parametric Monte Carlo tests with the A-D statistic were 
used. The number of replications was 9,999. The significance probabilities 
were combined using the method described in section 2.2.5 (page 43). 



5.3. Mixed distributions 


131 


Significance 
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Figure 5.27: Goodness-of-fit tests for the exponential tail (50-70 km/h) 


Figures 5.27 and 5.28 show the combined significance probabilities for 
different volume levels. Figure 5.29 shows the combined significance prob¬ 
abilities for different speed limits. The threshold value for the exponential 
tail hypothesis is eight seconds, after which the test results are nonsignif¬ 
icant. Because of such a large threshold value, a positively skew headway 
distribution should be considered for the follower headways. 

Low speed roads (fig. 5.27 and 5.29) show nonsignificant values also at 
4.5 < 5 < 6 s. This phenomenon appears at all volume levels. It may indi¬ 
cate lower threshold values. The lower threshold (8 = 4.5) is, however, not 
accepted, because there are significant deviations from the exponential dis¬ 
tribution at (4.5s < 8 < 8s). The speed difference data (fig. 5.30, page 133) 
also support a higher threshold. 

Miller (1961) tested for threshold values four, five and six seconds, and 
found 8 = 6 s about ideal. His data were from a straight section on a two- 
lane road between Stockholm and Uppsala in Sweden. He noted that there 
is a slight excess of observed intervals in the 6-8 s range as some drivers 
begin to slow down, when they approach this close to the vehicle ahead. He 
decided that some criterion based upon relative velocities is necessary for 
determining the platoons, when time intervals are less than eight seconds. 

Wasielewski (1979) studied the exponential tail hypothesis of headway 
distributions from the center westbound lane of Route 1-94, a six-lane di¬ 
vided urban expressway in Detroit. He found no significant deviations from 
the exponential distribution, when the threshold was 2.5 seconds for volumes 
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Figure 5.28: Goodness-of-fit tests for the exponential tail (80-100 km/h) 
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Figure 5.29: Goodness-of-fit tests for the exponential tail 
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Standard deviation of relative speeds 



Headway [s] 

Figure 5.30: Standard deviation of speed differences 


less than 1,450 veh/h and 3.5 seconds for volumes greater than 1,450 veh/h. 
These values are too low for two-lane roads. 

It is possible that the goodness-of-fit tests give too small threshold es¬ 
timates. The tests cannot detect small deviations from the exponential 
distribution for headways slightly greater than 5, if the tail is exponential. 
Consequently, an alternative approach was also examined. 

Speed differences 

Properties of speed differences Car-following models (section 2.1) sug¬ 
gest that the speed difference (relative speed) between the leader and the 
follower (vi-i — Vi) is a decisive factor in the car-following process. A follower 
adjusts the speed of the vehicle to the speed of the leading vehicle. This 
speed adjustment decreases the variation of speeds among trailing vehicles. 
At some time distance the interaction of speeds vanishes. 

Figure 5.30 shows the standard deviation (s v ) of speed differences of 
consecutive vehicles against the headway. Headways are combined in one 
second intervals (t— l,i]. At short headways (1-5 s) s v is small and increases 
as the headway increases. At large headways (t > 10 s) s v is rather constant. 
On high speed roads s v is larger than on low speed roads. 

A piecewise linear model was fitted to the data—rising slope for head¬ 
ways less than the threshold and a constant value for headways greater than 
the threshold. The s v ’s were weighted by the number of observations in the 
interval. On both low and high speed roads the threshold value of about 9 s 
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was obtained. However, more extensive data sets and more accurate mea¬ 
suring equipment are needed for further analysis, because speed differences 
have larger measurement errors than absolute speeds. 

This result is in agreement with the goodness-of-fit tests above. The 
original Bureau of Public Roads (1950) reports a similar analysis and virtu¬ 
ally the same result. Chishaki & Tarnura (1984) present a very similar figure 
of the variance of the speed differences. Branston (1979) reports threshold 
values 4.5 s and 3.75 s for the nearside and offside lanes on M4 motorway 
in West London. 

Enberg & Pursula (1992) have shown that on high-class two-lane rural 
roads, the relation between the headways and the standard deviation of 
speed differences is similar to figure 5.30. For headways larger than about 
five seconds, there is no significant increase in the standard deviation. When 
the headway is less than one second, the standard deviation jumps to about 
10 km/h. This phenomenon is also reported in Bureau of Public Roads 
(1950) and by Branston (1979). A similar, but not as strong, phenomenon 
can be seen on high speed roads in figure 5.30. It may be assumed, that at 
very short headways some vehicles are starting to pass the vehicle ahead. 
The increase in s v is very small or nonexistent in figure 5.30, because there 
were only few passing vehicles. 

Figure 5.30 shows that vehicles may be divided into three categories. 
Leading vehicles are not affected by vehicles ahead. The drivers select their 
own desired speed and headways can be assumed to follow the exponen¬ 
tial distribution. Trailing vehicles follow the vehicle ahead and adjust their 
speeds accordingly. Because the mode of the headway distribution is about 
1.5 seconds, this headway can be assumed to be the average desired head¬ 
way of the followers. In addition to these two classes there are vehicles in 
a transition stage. These vehicles are neither following the vehicle ahead 
nor driving freely. The transition stage seems to begin, when the headway 
is about 8-9 seconds. Speed adjustment decreases the variation between 
speeds of consecutive vehicles. At headways less than one second, the speed 
variation increases again on high speed roads, because the vehicles starting 
to pass have greater speed than the vehicle ahead. 

Figures 5.31 and 5.32 show some basic statistics of speed difference distri¬ 
butions. Mean is near zero at all headways. As the headway is between two 
and eight seconds, the mean is slightly positive, which is probably caused 
by passers that have returned to the right lane. For headways greater than 
eight seconds the variation in the mean increases, and the mean oscillates 
around zero. When the headways are greater than ten seconds, the mean 
speed difference is negative. In this case some proportion of the vehicles 
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Figure 5.31: Mean, skewness and kurtosis of speed difference distributions (50- 
70 km/h) 


ahead are followers having constrained speeds while the speeds of 

the vehicles behind (vi) are nonconstrained. 

The skewness indicates that the speed difference distribution is practi¬ 
cally symmetric. On high speed roads at headways less than one second, the 
skewness has a rather large negative value. Considering the slightly negative 
mean and the increased standard deviation, the negative skewness is most 
likely caused by vehicles beginning to pass, thus producing negative speed 
differences. 

Summala & Vierimaa (1980) found that the speed difference distribution 
becomes skew, as the spacing grows over 50 m. At spacings longer than 
200 m the distribution is again symmetric. This phenomenon is not found 
in figures 5.31 and 5.32. 

The most significant feature in the figures is the excessively positive 
kurtosis at short headways. (The kurtosis is scaled so that the normal dis¬ 
tribution has null kurtosis.) The speed difference distributions are more 
sharply peaked than the normal distribution (see also fig. 5.33). This is 
true especially for headways less than four seconds. The kurtosis is largest 
at headways between one and two seconds, and the peak is higher on high 
speed roads. Because at headways between one and two seconds the distri¬ 
bution is symmetric with zero mean, it is evident that at these headways 
most drivers are content to follow the leader at the same speed. The kurto¬ 
sis approaches zero and the distribution approaches normality at headways 
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Figure 5.32: Mean, skewness and kurtosis of speed difference distributions (80- 
100 km/h) 


larger than seven seconds on low speed roads, and at headways larger than 
ten seconds on high speed roads. The variation in the kurtosis increases at 
large headways due to fewer observations. 

Proportion of free headways The distribution of speed differences at 
long headways can be assumed to be the distribution of speed differences for 
free flowing vehicles. The mean and standard deviation of speed differences 
at headways t > 10 s are v = —0.879 and s v = 10.492 on low speed roads, 
and v = —0.862 and s v = 11.660 on high speed roads. Assuming normality, 
the means do not differ significantly from zero, but the nonequality of the 
standard deviations is statistically significant (at level P = 0.001). On high 
speed roads the speed differences have larger variation. 

Because not every vehicle having a headway shorter than eight seconds 
is a follower, the distribution of their speed differences is a mixture of free 
speed differences and constrained speed differences. If the speed difference 
distribution //(/Ji, 02 ) of free flowing vehicles is estimated, the speed differ¬ 
ence distribution /(//, a) of all vehicles can be estimated as a mixed normal 
distribution (fig. 5.30): 

f{p,a) = + (1 ~p)f 2 ( 1 * 2 , 02 ), (5.86) 

The proportion of trailing vehicles (p) and the parameters of the speed 
difference distribution (//i,<7i) of trailing vehicles must be estimated from 
the data. 









5.3. Mixed distributions 


137 


Frequency 



Figure 5.33: Mixed normal distribution for speed differences on high speed (80- 
100 km/h) roads 


The minimum chi-square method (section 2.2.3) was used to estimate p, 
while maximum likelihood estimates were used for and <7\. The estimate 
for the proportion of free flowing vehicles (1 — p) among headways t < 8 s 
is 20 % on high speed roads and 15 % on low speed roads. The proportion 
of trailing vehicles is approximately the same as the proportion of headways 
t < 3 s on high speed roads and t < 5 s on low speed roads. 

Parameter estimation for the exponential tail 

The goodness-of-fit tests above (p. 130) indicated that headways larger than 
eight seconds belong to the exponential tail. Assuming the eight second 
threshold, the scale parameter estimate for the exponential tail is equal to 
the scale parameter estimate for the shifted exponential distribution with 
t = 8 s: 

* = 1^8 < 5 ' 87) 

where i& is the mean of headways greater than eight seconds. 

The number of free vehicles can be assumed to increase with increasing 
flow rate. Consequently, the scale parameter ( 9 ) of the exponential tail can 
be assumed to have a positive correlation with the traffic volume (A). 6 A 
linear model (9 = a + bX) was tested using the linear regression analysis. 


See also section 4.2.3, page 60. 
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Scale parameter 



Figure 5.34: Scale parameter estimates for the exponential tail 


Because the analysis of covariance (ANCOVA) indicated that there is a 
significant (P = 0.006) difference between the scale parameter estimates for 
low speed and high speed roads, separate models ( 6 * 50 , 0 so) were tested for 
both road categories. The constant term (a) was omitted, because it did not 
differ significantly from zero. (As the flow rate approaches zero, the scale 
parameter must also approach zero.) The following equations for the scale 
parameter estimates were obtained (fig. 5.34): 

0.5o = 0.813A (5.88) 

080 = 0.561A, (5.89) 

where A is the traffic volume [veh/s ]. 7 

In order to stabilize the variance, which increases with growing traffic 
volume, 1/A 2 was used as a weighting factor. The coefficient of determina¬ 
tion is R 2 = 0.936 for low speed roads and R 2 = 0.823 for high speed roads. 
The significance of the regression is P < 0.0005 in both cases. 

On high speed roads the estimates are lower. This indicates a “longer 
tail” and accordingly higher coefficient of variation, skewness and kurtosis. 
The scale parameter estimates are also the estimates for the hazard rate at 
the tail of the headway distribution. 

7 Wasielewski (1979) obtained for a center expressway lane: 


6 = -0.058 + 1.401A. 


(5.90) 
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The scale parameter was estimated using the location parameter value 
of eight seconds. The estimate is, however, valid even if the true location 
parameter has a lower value. This can be shown easily. If the true location 
parameter is a, then: 

P {T > t} = e~ e(t - a) . (5.91) 

If only headways greater than b (and a < b) are examined, then: 

e —0(t—d) 

~ g-O(b-a) (5. 

= e" 0(t " 6) . 

A larger location parameter does not cause any bias in the scale parame¬ 
ter estimate. The estimate is, however, based on fewer observations, thus 
reducing the accuracy. 



5.3.4 Hyperexponential distribution 

Properties of the hyperexponential distribution 

Hyperexponential distribution was first applied to headway studies by 
Schuhl (1955), hence it is also known as Schuhl’s (composite exponential) 
distribution. If both leader and follower headways are exponential, the head¬ 
way distribution is: 

f(t\p,0i,d 2 ) = vQie-^ + (1 - p)6 2 e- e 2 *. (5.93) 

The headways are produced by a service system with two parallel servers, 
each having exponential service times (fig. 5.35). As noted above, there is a 
constant queue to the system. 

The pdf of the hyperexponential distribution (fig. 5.36) starts from 
/(0) = p0\ + (1 — p)9 2 and decreases asymptotically to zero. The hazard 
function decreases from h( 0) = p9\ + (1 — p)0 2 to liny^oo h(t) = min {9 1 , 9 2 } 
(Trivedi 1982). The coefficient of variation is C(T) > 1 (Trivedi 1982). 

Schuhl (1955) used a location parameter (r) for the follower headways. 
The resulting pdf was: 

f{t\p, t, 0i, 0 2 ) = p0ie -0l(t ~ r) + (1 - p)9 2 e~ d2t . (5.94) 

Free headways were not given a location parameter, because these vehicles 
were considered free to pass. 
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Figure 5.35: Queuing model for the hyperexponential distribution 


Volume [veh/h] 



Headway [s] 

Figure 5.36: Hyperexponential density functions with parameters by Grecco & 
Sword (1968) 
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Hyperexponential distribution in headway studies 

Reasonability As compared with the negative exponential and the shifted 
exponential distributions, the hyperexponential distribution has the advan¬ 
tage of being able to produce a larger coefficient of variation. It also has 
separate models for free and follower headways. The negative exponential 
distribution is, however, not an appropriate model for constrained headways. 
The problem of too short headways can be avoided by location parameters, 
but this introduces an artificial discontinuity in the density function. 


Applicability Pseudo-random number generation is fast, although not as 
efficient as for the negative exponential distribution. The Laplace transform 
has a simple form: 


f*(s | p, 61 , 62 ) =p 


di 

s + 9\ 


+ (1 ~p) 


O2 

S + 62 ' 


(5.95) 


Without location parameters the hyperexponential distribution has three 
parameters to estimate. Introducing one or two location parameters in¬ 
creases the total number of parameters to four or five, which makes the 
parameter estimation more difficult. 

Grecco & Sword (1968) assumed a fixed value of r = 1 s for the location 
parameter, and estimated the three remaining parameters by minimizing 
the chi-square statistic. The regression equations for the parameters were: 


p = 

0.115A 

(5.96) 

100 

T = 

1 

(5.97) 

0 i = 

0.4 

(5.98) 

02 = 

1 

(5.99) 



24 — 0.0122A 


where A is the traffic volume [veh/h]. These estimates are, however, valid 
only for A < 870 veh/h, since p > 1 for A > 870 veh/h. Figure 5.36 shows 
the pdf calculated with these parameter values. 

Griffiths & Hunt (1991) applied the location parameter (r) to both fi(t) 
and f- 2 {t), and called the model double displaced negative exponential distri¬ 
bution (DDNED). The parameters were estimated by a “hybrid method”, 
using the equations for the mean and the variance as constraints, and min¬ 
imizing the chi-square statistic. The estimation was constrained so that p 
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remained in the range 0 < p < 0.5, and f was required to be greater [sic] 8 
than the smallest observed headway (t^). 


Validity Grecco & Sword (1968) collected headway data on a two-lane 
portion of the U.S. 52 bypass. The headway data were grouped in one 
second intervals up to twenty seconds. The data were collected in volume 
groups using the class length of 100 veh/h. The chi-square tests with 8-17 
degrees of freedom gave acceptable fit for volumes less than 700 veh/h. The 
combined probability (calculated by the present author) of the six tests is 
0.79. 

Griffiths & Hunt (1991) examined the vehicle headways in urban areas. 
The model had the same location parameter r for both fi(t) and / 2 (f). 
The mean of r was 1.5 s, the minimum 0.58 s, and the maximum 2.66 s. 
The chi-square test with 5 % significance level gave acceptable fit for 78 of 
86 data sets. Because r was greater than the shortest observed headway in 
each sample, the model would have failed all A-D tests with null significance 
level. 

Sainio (1984) tested the hyperexponential distribution 9 against headway 
data from high class Finnish highways. He presented only the chi-square 
test statistic values for the five samples. When the significance probabilities 
are calculated (u = 10), the model is rejected in three samples at 0.01 level. 
The combined significance probability is 3.9 • 10~ 6 . 

The hyperexponential distribution does not pass the identification pro¬ 
cess. Although the coefficient of variation can be greater than unity, the 
shape of the pdf and the hazard function do not have the correct shape. 
The hyperexponential distribution has similar properties as the gamma dis¬ 
tribution with shape parameter less than unity. The pdf has a similar shape, 
and the c.v. is larger than unity. The hyperexponential distribution, how¬ 
ever, has a more reasonable hazard rate at t = t, and the kurtosis is slightly 
closer to the empirical kurtosis, when plotted against the squared skewness. 
Consequently, the identification process suggests that the hyperexponential 
distribution could perform slightly better than the gamma distribution as a 
headway model. 


8 If the location parameter (r) is greater than the smallest headway (tm), the proba¬ 
bility P{T < t(i)} is null. Consequently, the headway sample cannot be generated by the 
proposed model. 

9 Sainio (1984) called it the Kell distribution (see Kell 1962). 
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Figure 5.37: Queuing model for the hyperlang distribution 


5.3.5 Hyperlang distribution 
Properties of the hyperlang distribution 

Dawson (in Drew 1967, Dawson & Chimini 1968, Dawson 1969) has sug¬ 
gested the Erlang-distribution as a model for constrained headways. The 
headways in the hyperlang (hyper-Erlang) model are produced by a service 
system with two alternative channels (fig. 5.37). Channel 1 is selected with 
probability p, and it has one server with Erlang-distributed service times, 
or alternatively r servers in series each having the same exponential service 
time distribution. The service time distribution in channel 2 is exponential. 
A location (threshold) parameter is shown as a server (D) with a constant 
service time. 

The hyperexponential distribution is naturally a special case of the hyp¬ 
erlang distribution (r = 1). The generalization to exponential-gamma mix¬ 
ture distribution is straightforward. 

The pdf of the hyperlang distribution is: 10 

f(t\p,T l ,9 1 ,T 2 ,02,r) = p 9 -^j —— e -0i(t-n) + (i _ p ) 6 i 2e - 02 (t-r 2 )_ 

(r- 1)! 

(5.100) 


°In place of 8i Dawson (1969) used parameter 7 i, so that: 


9i = 


7i - n 


where 7 ; is the average headway in channel i. 
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The distribution has Laplace transform (Luttinen 1990): 
f*(s\p,Ti,e 1 ,T 2 ,e 2 ,r) = p( ) e~ ST1 + (1 -p)—f— e~ ST2 . (5.101) 

\S + 01 J S + 0-2 

The mean and the variance are (Luttinen 1990): 

H(T) =p ^ Wn') + (1 -p) ^ + 7^ (5.102) 

v 2 ( t )=p (^; + Tl ) + +( 1 - p) (^ + T2 ) + ~ q ~ 2 ( 5 - 103 ) 

■K^ +t, ) +(i_p, (s + t2 )] ■ 

The hazard function of the hyperlang distribution is similar to the empir¬ 
ical hazard function, except that at zero headway h( 0 | p,9i,02,r) = # 2 > 
assuming t\ = T 2 = 0 (Trivedi 1982). 

Hyperlang distribution in headway studies 

Reasonability The hyperlang distribution has exponential tail, and the 
shape of the pdf is similar to the empirical headway distributions. Some 
additional flexibility could be obtained using gamma distribution for con¬ 
strained headways. 

The distribution avoids extremely short headways, but the location pa¬ 
rameter ( 72 ) of the free headway distribution makes the pdf and the hazard 
function discontinuous. The model assumes that the frequency of free head¬ 
ways is highest, when the headway is only slightly greater than T 2 , and there 
are no free headways, when t is a little smaller than 72 . The problem is more 
outstanding at low flow conditions, when the proportion of free headways is 
large. 

Applicability The large number of parameters (six) causes problems in 
parameter estimation. Dawson Sz Chimini (1968) and Summala & Vierimaa 
(1980) have estimated the parameters using least squares fit. For the head¬ 
way data in HCM (1965), Dawson & Chimini (1968) estimated r = 2 and 
72 = 0.75 for all samples. The estimated density and hazard functions are 
displayed in figures 5.38 and 5.39. 

The generation of pseudorandom numbers is reasonably effective. The 
Laplace transform exists, but the mathematical analysis is more tedious 
than the analysis of the exponential and the gamma distributions. 
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Volume [veh/h] 



Headway [s] 

Figure 5.38: Hyperlang density functions [parameters by Dawson & Chimini 
(1968)] 



Figure 5.39: Hyperlang hazard functions [parameters by Dawson & Chimini 
(1968)] 
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Validity Dawson & Chimini (1968) and Dawson (1969) did not report any 
goodness-of-fit tests. Because the hyperlang distribution is a generalization 
of negative exponential, shifted exponential, Erlang and hyperexponential 
distributions, it is likely to fit the headway data better. But for a six pa¬ 
rameter distribution, the goodness-of-fit requirements are extremely high. 


5.3.6 M/D/1 queuing model 

Tanner (1953, 1961) has modeled the arrival process as a M/D/1 queuing 
process (fig. 5.40). The service system has one server with constant service 
time (r). There is not, however, a continuous queue to the service point, 
but the arrival process is Poisson. 

The vehicles in platoons have constant headways, and the free headways 
follow the shifted exponential distribution. The probability for a follower 
headway (t = r) is equal to the utilization factor 11 (p) of the server: 

P {T = r} = At = p (5.104) 


For steady state conditions, the utilization factor must be less than unity. 
The platoon length follows the Borel distribution, and the probability for 
platoon length k is (Tanner 1961): 


Pk 


(Xrk) k 1 Xrk 
n\ ' 


(5.105) 


Because the platoon length distribution is not geometric, the headways are 
not i.i.d. 

11 The utilization factor (p) is the expected number of arrivals per mean service time 
in the limit (Gross & Harris 1985), or equivalently the fraction of time that the server is 
busy (Kleinrock 1975). 








5.3. Mixed distributions 


147 


Volume [veh/h] 

14 °Sooo 

600 



Headway [s] 

Figure 5.41: M/D/1 density function (r = 1 s) 


The PDF of departure intervals (headways) is discontinuous (Luttinen 
1990): 


|A,r ) = 


0, if t < t 

1 — (1 — Ar)e -A ^ -T ), otherwise. 


The pdf is obtained by differentiation (Luttinen 1990): 
/( 1 1 A, r ) = 


0, if t < t 

(1 — Ar)Ae _A ^ _T ), otherwise. 


(5.106) 


(5.107) 


The pdf is equal to the pdf of the shifted exponential distribution multiplied 
by the server idleness factor (1 — p). The mean, the variance, and the 
coefficient of variation are (Luttinen 1990): 


MT) -1 

(5.108) 

ff2 < r > = X2 - T " 

(5.109) 

C(T) = y/l - A 2 r 2 < 1. 

(5.110) 


Cowan (1975, 1978), used the headway distribution assuming a renewal 
process. The Cowan M3 model is accordingly a generalization of the M/D/1 
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model: 


F(t\6,X 


0, if t < t 

1 — (1 — 6)e~ x ^~ T \ otherwise. 


(5.111) 


The parameters of the M/D/1 headway model can be easily estimated 
by the method of moments. The scale parameter estimate (A) is equal to 
the traffic volume: 

A = | (5.112) 

and it does not depend on r. The trailing headway (r) can be estimated 
using the sample variance: 


T = 



(5.113) 


M/D/1 queuing model in headway studies 

Reasonability In some sense the M/D/1 model follows logically from the 
shifted exponential distribution: If the vehicles move freely at headways 
greater than r, and headways shorter than r are not possible, there must 
be some platooning with headways equal to r. The model separates leading 
and following vehicles, but the hypothesis of constant headways for following 
vehicles is unrealistic. 


Applicability The M/D/1 model is rather simple to analyze mathemat¬ 
ically. The parameter estimation is straightforward, and the generation of 
pseudorandom variates does not present any problems. 


Validity Sullivan & Troutbeck (1994) applied the Cowan’s M3 model to 
urban traffic flow. They concluded that the model is appropriate to eval¬ 
uate the performance of roundabouts and unsignalized intersections, where 
accuracy is only required for headways greater than the critical gap. They 
did not present any goodness-of-fit tests. 

Akgelik & Chung (1994) described the calibration of the M3 model, 
which they called the “bunched exponential distribution”. On the basis of 
visual evaluation they considered that the model provides good estimates 
of arrival headways, and they strongly recommended its use in stead of the 
negative exponential and the shifted exponential distributions. 
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Figure 5.42: M/D/1 hazard function (r = 1 s) 


The discontinuity of the pdf and even the PDF is a major problem in 


the model. The goodness-of-fit becomes worse as the proportion of trailing 


vehicles (p) increases. The unfeasibility of the model is, however, most 
obvious in the hazard function plot of figure 5.42. In addition, the coefficient 
of variation is too low for most flow conditions. 

5.3.7 Generalized queuing model 

A queuing model with Poisson arrivals and a general service time distribu¬ 
tion (fig. 5.43) is called the M/G/l queuing system (fig. 5.43). It produces 
follower headways (Ti) as long as the server is busy. With probability 1 —p, 
the server experiences some idleness during the interdeparture period. The 
length of an interdeparture time with idleness is the sum of the service time 
(Tj) and the idle time (T2) (Gross & Harris 1985). The density of this sum 
is the convolution (/1 * f 2 ) of the service time density and the interarrival 
time density, where interarrival time density (/o) is exponential. 

The pdf of the inter departure time distribution is: 



(5.114) 
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The PDF is (Gross & Harris 1985): 

F(t) = pF\{t) + (1 — p) f F\(t — u)6e~ eu du, 

Jo 

or equivalently: 

F(t) = pFi(t) + (1 - p) J fi(u) (l - e ~ e(t ~ u) ^j du 

= Fx(t) - (1 -p)e~ et f fi{u)e du du. 

Jo 


(5.115) 


(5.116) 


The Laplace transform is obtained using the convolution property 12 of the 
transform: 


/*(») =p/i(») + (i-p)/i (»)/!(») 
= pfi(s) + {l-p)ft(s)j^-g. 


(5.117) 


The proportion of interdeparture times with no idleness (p) is equal to the 
utilization factor (p) of the server (Gross & Harris 1985): 


_ KTi) 

11 h(t 2 y 


(5.118) 


Cowan (1975) used gamma distributed service times. Branston (1976) 
applied both gamma and lognormal service time distributions. The models 
were modified so that p was not equal to the utilization factor, but it was 
estimated from the data. Cowan called this model M4, while Branston called 
it the generalized queuing model. Figure 5.44 shows the pdf of the model with 
lognormal follower headways. The parameters follow the approximations 
given by Branston. Figure 5.44 shows the corresponding hazard function. 
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Figure 5.44: Density function of the lognormal generalized queuing model 


Hazard rate 
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Figure 5.45: Hazard function of the lognormal generalized queuing model 
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For comparison with the semi-Poisson distribution, some properties of 
the generalized queuing model with gamma distributed service times are 
presented. The pdf is: 13 


a+a—l 


(3 a t 


f(t\p,P,a,0) =p '" r J , e dt + (l -p)^—9e et [ u a x e u{ ' 3 9) du 

r(a) J o 


= P~ 


,-Pt 


F(«) 

(3 a t a ~ 1 

r^T 


(3 — 9 


r(a) 


(5.119) 


where y(-) is the incomplete gamma function (see table 5.3, page 104). The 
pdf may also be expressed as: 


f(t | p,0,a,9) = pf L (t | (3, a) 

+ (!-p) (^) Ti(t | (3-d,a)f 2 (t | 0). (5.120) 


Correspondingly, the PDF can be expressed as: 


F( t | p, 0, a, 9) = 7 ^.’ ^ - (1 - p) 


F(a) 

= Fi(f | /3,a) - (1 -p) 
The Laplace transform is: 

f*(s 0 = 


0 \ a l[ a A0- e )] e -et 


[3-9 


r («) 


/? 


/3-0 


Fi(f |/?-0,a)[l-F 2 (f |0)]. (5.121) 


0 


s + [3 


P + (l-p) 


s + 9 


(5.122) 


The moments as well as the measures of variation, skewness and kurtosis 


12 Convolution property of the Laplace transform: 

p* n* p* 

JX+Y — J X jY • 

See Kleinrock (1975). 

13 See relation 3.381.1 in Gradshteyn & R.yzhik (1980). 
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are now easily obtained: 


»(T) 
a\T) 
C(T ) 


{3 9 


/3 2 9 2 

i Jaff 2 + (3 2 (1 — p 2 ) 
a9 + (3(1 — p ) 


2[/? 3 (l-p 3 ) + afl 3 ] 

] [/3 2 (l-p 2 )+«0 2 ] 3 / 2 

3 {/3 4 [4 - (1 + p 2 ) 2 ] + a@ 2 [2/3 2 (l - p 2 ) + 0 2 (« + 2)]} 
“ 4l j [/3 2 (l-p 2 ) + a0 2 ] 2 


(5.123) 

(5.124) 

(5.125) 

(5.126) 

(5.127) 


Three special cases are worth mentioning: 


1. p = 1. The model is equal to the gamma distribution [p(T) = a/(3\. 

2. p = 0. The headway distribution is the sum of a gamma and an ex¬ 
ponential random variate [p(T] = a/(3 + 1/9]. As the mean follower 
headway ( a/(3 ) approaches null, the distribution of all headways ap¬ 
proaches exponentiality [fi(T) = 1/9}. 

3. p = p < 1. The departure rate is equal to the arrival rate [1 / p(T) = 9]. 
This is the M/G/l model. 


Generalized queuing model in headway studies 

Reasonability According to Cowan (1975) the generalized queuing model 
describes a situation that an oo-lane road merges into one lane according 
to the first-come-first-merge rule such that each driver leaves a randomly 
chosen tracking headway. The headways at the entry point can be described 
by the generalized queuing model. 

Wasielewski (1979) criticizes the generalized queuing model for lacking 
in a physical basis. A free headway is a sum of a constrained headway (T\) 
and an exponential variate (T 2 ). But the rationale for selecting exponential 
distribution for free headways lies in the fact that these vehicles are not 
influenced by the vehicle ahead. 

Wasielewski prefers the semi-Poisson distribution, but as will be shown, 
the free headway in the semi-Poisson model is also a sum of two random vari¬ 
ates. In fact, the two distributions are extremely similar: The generalized 
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queuing model with gamma distributed follower headways was plotted us¬ 
ing the same parameter values as for the semi-Poisson distribution in figure 
5.48. The two pdf’s were virtually indistinguishable. 

Although the generalized queuing model gives separate models for free 
and follower headways, it is not based on the principles of the traffic ffow 
dynamics. The model is, however, very flexible, or actually a family of 
models. 


Applicability One of the benefits of this model is that it enables the use 
of queuing theory tools for the M/G/l systems. The Laplace transform 
exists, if it exists for the follower headway distribution. In addition, the 
random number generation is straightforward. 

The mathematical tractability of the generalized queuing model depends 
on the service time distribution. The gamma distribution does not present 
unsurpassable problems. The lognormal distribution is theoretically inter¬ 
esting as a model for follower headways, but it leads to more complicated 
formulas. Moreover, it does not have the Laplace transform. 

Parameter estimation for the generalized queuing model is problematic. 
Branston (1976) estimated the parameters by minimizing the chi-square 
statistic. Cowan (1975) tried the maximum likelihood method, but could not 
solve the resulting set of four likelihood equations. The method of moments 
yielded easier equations, but the estimates were infeasible. Finally, Cowan 
estimated the parameters by minimizing the K-S statistic. 


Validity The probability density function (fig. 5.44) and the hazard func¬ 
tion (fig. 5.45) have the correct basic shape. The model is flexible enough 
in terms of the coefficient of variation. Figure 5.46 shows the relation of 
the kurtosis and the squared skewness for the generalized queuing model 
with gamma distributed follower headways. The parameters were generated 
randomly within the limits displayed in the figure. The model can produce 
distributions such that the kurtosis is only slightly greater than the kurtosis 
of empirical headway distributions. 

Cowan (1975) fitted the generalized queuing model (he called it M4) to 
a data set of 1,324 successive headways from Mona Valley Road in Sydney. 
The headways were observed over approximately two hours on the west¬ 
bound lane of a two lane road. He used gamma distributed service time 
headways. The minimized K-S statistic was 0.015, which is not significant 
at 1 % level. 
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Kurtosis 



Squared skewness 


Figure 5.46: KS 2 -chart of the generalized queuing model with gamma distributed 
follower headways (simulated data) 


Branston (1976) compared the generalized queuing model and the semi- 
Poisson distribution. The data were from the fast lane of M4 motorway 
in England. He reported average probability of the 16 chi-square tests to 
be 0.196 for the senri-Poisson distribution with gamma distributed zone of 
emptiness. For the generalized queuing model with gamma and lognormal 
service time distributions Branston reported average probabilities of 0.193 
and 0.359, respectively. The combined probabilities are, however, 0.0004, 
0.0002, and 0.0985, respectively. For the slow lane, the generalized queuing 
model did not give substantially better fit. 

5.3.8 Semi-Poisson distribution 
Properties of the semi-Poisson distribution 

The semi-Poisson model of Buckley(1962, 1968) is based on the conjecture 
that in front of each vehicle there is a zone of emptiness, which describes 
the fluctuations in the car following (Wasielewski 1979), and has probability 
density fi(t). The exponential free headway distribution is modified so that 
it includes only headways greater than a random variate (Z) sampled from 
/i(t). Figure 5.47 shows the senri-Poisson model as a queuing system, where 
M’ stands for a modified exponential service time distribution. 

The free headway PDF is: 


F 2 (t) = P{ Y < 1 1 Y > Z } , 


(5.128) 
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Figure 5.47: Queuing model for the semi-Poisson distribution 


where Y is a random variate from the negative exponential distribution. 
Using the definition of the conditional probability, the PDF can be expressed 
as: 


_ P{Y < t,Z < Y} 


P{Z < Y} 
Frz(t,Y) 

Fi(Y) ’ 


(5.129) 


where F\ (Y) is the probability that an exponential random variate (Y) 
is greater than a random variate from the follower headway distribution. 
Fyz(t,Y) is the probability that t is greater than or equal to an exponential 
random variate Y. which, in turn, is greater than a random variate Z from 
the follower distribution. 

For a fixed value Y = y. the probability P {Z < Y} is the product 
¥{Y = y}P{Z < y}. If Y is exponentially distributed with pdf <?(•), the 
probability Fyz(t,Y) = P{Y <t,Z< Y} can be expressed by integrating 
from null to t : 



(5.130) 


The pdf is obtained by differentiation: 


fyz(t,Y) = —F Y z(t,Y) 

= g(t)Fi(t). 


(5.131) 
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Consequently, the pdf for free headways is: 


/ 2 (f) 


fyz{t,Y) 

Fi(Y) 

m) 


Fi(Y) 


9(t)- 


(5.132) 


F\{Y) is the probability that an exponential random variate (Y) is 
greater than a random variate (Z) from the follower distribution. For 
a fixed value Y = y, the probability P{T > Z} is equal to the product 
¥{Y = y}F{Z <y}. If Y is exponentially distributed with scale parame¬ 
ter 9 , the probability F\ (Y) can be expressed by integrating from null to 
infinity: 

poo ry 

F\(Y) = / 9e~ ey / fi(z) dz dy. (5.133) 

Jo Jo 

Using the relation 29.2.6 of Abramowitz & Stegun (1972), this probability 
is the same as the Laplace transform of the follower distribution (Buckley 
1968): 

Fi(Y) = mO). (5.134) 

This leads to the equation for the free headway pdf: 


/a(f) = fffijS®. (5.135) 

The Laplace transform can be derived using the same property 29.2.6 of 
Abramowitz & Stegun (1972): 


f) ft 

m -mL 

0 ftis + O) 

s + e ' ff(e) ' 


(5.136) 


As Buckley (1968) has observed, the first term is the Laplace transform 
of the exponential distribution g(t) = 9e~ et . Accordingly, the Laplace trans¬ 
form of the free headway distribution is the convolution of two distributions, 
namely g(t) and, say, c(t). Because c(t) has the Laplace transform: 


c*(s) 


ms+9) 

fm 


(5.137) 
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it can be derived by the inverse transform (Buckley 1968): 

°(t) = JP§~f~ 6t - ( 5 - 138 ) 

A free headway is thus a sum of two random variates, which follow the 
distributions g(t) and c(t). 

The pdf for the semi-Poisson distribution is (Buckley 1968): 

f(t) =pf 1 (t) + (l-p) I ^-6e- e \ t>0, 9>0, 0<p< 1. (5.139) 

JiW) 

The PDF is obtained by integration: 

F(t) = pFi(t) + [ Fi(z)6e~ dz dz, t > 0, 0 > 0, 0 < p < 1. 

J l W) J o 

(5.140) 

The distribution has the Laplace transform (Buckley 1968): 

/•w=p/rw+(i-p) ( ^ ( ^ ) - (5.14D 


Follower headway distribution 

Buckley (1962) suggested the normal distribution as a model for followers’ 
headways. He found that the error due to the non-zero probability of neg¬ 
ative headways was very small. Later Buckley (1968) applied gamma and 
truncated normal distributions. He observed that gamma distribution has 
the general shape which is assumed to be correct for follower headways. On 
the other hand, he assumed that the truncated normal distribution could 
describe the error in the tracking task, when a driver attempts to maintain a 
desired spacing. With gamma distributed follower headways Buckley could 
not obtain solutions for the parameters at low and low-medium flows. When 
the parameters for both models were obtained, the model with gamma dis¬ 
tributed follower headways gave similar or better goodness of fit than the 
truncated normal distribution. 

Ashton (1971) saw no strong theoretical justification for choosing the 
normal distribution. He chose gamma distribution because of its connection 
with the exponential distribution. Such a model should be theoretically 
more tractable. In the data analysis Ashton, however, used exponential 
zone of emptiness. 
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Branston (1976) applied the model with the normal and the gamma dis¬ 
tributions for the follower headways. He estimated the parameters by com¬ 
bining the method of moments for p and 6 with the minimum chi-square 
method for f3 and a. He did not have similar difficulties as Buckley (1968). 
The goodness-of-fit was not acceptable, when the the normal distribution 
was used for the follower headways. For gamma distributed follower head¬ 
ways he reported an acceptable goodness of fit. 

Wasielewski (1974, 1979) calculated the distribution for the follower 
headways directly from the empirical headway distribution without intro¬ 
ducing a parametric form. This nonparanretric approach is not followed here, 
because in theoretical work and simulation a parametric model is preferable. 

To sum up, normal, truncated normal, exponential, and gamma dis¬ 
tributions have been suggested as models for the follower headways. The 
exponential distribution can describe the traffic flow with no vehicle inter¬ 
action, but it is unsuitable as a follower headway model. The studies of 
Buckley and Branston suggest that the gamma distribution gives better re¬ 
sults than the normal and the truncated normal distributions. Lognormal 
distribution would be theoretically an interesting alternative, but its useful¬ 
ness is limited, because it does not have a Laplace transform. Consequently, 
the gamma distribution was selected as the follower headway model. 

The pdf (fig. 5.48) of the semi-Poisson distribution with gamma dis¬ 
tributed follower headways is: 


f(t\p,P,a,6) = p ^*) . (Je m + (1 - p)^ 


F(«) 


F(a) 


1 + ^' 06 


-et 


t. > 0; p,a,6 > 0; 0 <p < 1, (5.142) 


where y(-) is the incomplete gamma function (see table 5.3, page 104). The 
PDF is: 


F(t\p,(3,a,6) = p 7 ^.’^ + (1 - p ) 


F(a) 


r («) 


Q\ a rt 
1 + <? 


f 7 {a,(3u)e 9u du, 

Jo 


t. > 0; /3, a, 6 > 0; 0 <p < 1. (5.143) 


Figure 5.49 shows the estimated semi-Poisson hazard function for sample 4. 
The Laplace transform of the semi-Poisson distribution is: 


f*(s) = p 


U 


fJ + s ) 0 + s\^/3 + 0 + s 


/3 + 6 


(5.144) 


It is obvious that a free headway is a sum of two random variates: a variate 
from the exponential distribution and a variate from the gamma distribution 
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Density 



Figure 5.48: Semi-Poisson density function for sample 4 


Hazard rate 



Headway [s] 


Figure 5.49: Semi-Poisson hazard function for sample 4 
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with scale parameter equal to the sum (/? + 6) of the scale parameters from 
the follower gamma distribution and the exponential distribution. This re¬ 
sult can be used in the generation of semi-Poisson pseudo-random numbers. 

The moments of the distribution can be obtained using the Laplace trans¬ 
form: 


E(T r ) = (-17 


The first four moments are: 


a 


E(T) = p(T)=p- + (l-p) ^- + 

n 2 \ a ( a + 1 ) ^ 


1 


E(T z )=p- 


P 2 


+ C 1 -p) \ 7B. + 


n3N_ a ( a + !)(“ + 2 ) 


E(T 6 ) = p- 


p 3 


d r f*(s ) 

dsr s =o ’ 

(5.145) 

a \ 

P + o) 

(5.146) 

2 a a(a + 1) \ 

6{(3+e) + (p+oy) 

(5.147) 


(5.148) 


E(T 4 ) 


( 6 6 a 

a(a + l)(a 2)(a 3) 

p ^- 


3a(a + 1) a(a l)(a 2) \ 

9(P + 9) 2 + IPTW ) 

(5.149) 


/ 24 24a 12a(a + l) 

+ 1 “ P) [04 + 03(0 + 6/) + 02(0 + 0)2 

| 4a(a + l)(a H- 2) a(a H- l)(a H- 2)(a H- 3) 
+ 0(/3 + 0) 3 + (/3 + 0) 4 


The moments can also be expressed recursively: 

E(T r )=p^ + (l-p)B r , r = 0,1,2,..., (5.150) 

where 

A 0 = B 0 = 1 (5.150a) 

A r = (a + r — l)vl r _i (5.150b) 

B ' = 7’- 1 + (7r«r (5 ' 150c) 

The central moments can be expressed in terms of the noncentral moments. 
The rth central moment about the mean is (Stuart & Ord 1987): 



162 


CHAPTER 5. THEORETICAL HEADWAY MODELS 


Thus, the measures of dispersion can be calculated using the following equa¬ 
tions: 


a\T) 
as (T) 


a 4 (T) 


E{[T- M (T)] 2 } 
E {[T - Mr)] 3 } 
a\T) 

E{[T-p(T)}*} 

u 4 (T) 


(5.152) 

(5.153) 

(5.154) 


Concerning the four-stage identification process, the following observa¬ 
tions can be made: 


1. The pdf of the senri-Poisson distribution (figs. 5.48 and 5.50) is very 
similar to the empirical density function of the headway data. 

2. The hazard rate (fig. 5.49) has the same properties as the empirical 
hazard function. It starts from zero, rises to the maximum, falls, and 
reaches a constant level. 


3. In terms of the coefficient of variation the senri-Poisson distribution is 
very ffexible. 

4. Figure 5.51 shows the kurtosis of the senri-Poisson distribution against 
the squared skewness. The parameters were generated randomly 
within the limits displayed in the figure. The model can produce sam¬ 
ples such that the kurtosis corresponding to the squared skewness is 
only slightly greater than the kurtosis of headway samples. When the 
distribution approaches symmetry («3 —> 0), the kurtosis approaches 
the values corresponding to the gamma distribution (see fig. 4.15 on 
page 73). This is to be expected considering that a 4 —>• 3 (see ta¬ 
ble 5.3 on page 104), and the distribution approaches normality as p 
approaches unity, and a tends to infinity (Johnson et al. 1994). The 
figure is very similar to the corresponding figure (5.46) of the general¬ 
ized queuing model. 


The identification process indicates that the properties of the senri-Poisson 
distribution are very similar with the headway data. The KS 2 -chart displays 
most clearly the slight differences between the senri-Poisson distribution and 
the headway data. 
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Figure 5.50: Semi-Poisson pdf and sample edf. In sample 62 the “GOF esti¬ 
mate” shows the estimated pdf that was used in the goodness-of-fit 
tests. The “alternative estimate” shows a better estimate, which 
was found using another starting point to find the maximum of the 
loglikelihood function. 

Kurtosis 



Figure 5.51: KS 2 -chart of the semi-Poisson distribution with gamma distributed 
follower headways (simulated data) 
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Parameter estimation 

Method of moments estimators According to Greenwood & Durand 
(1960) the method of moments is inefficient, except for distributions closely 
resembling the normal. In this case the gamma distribution for follower 
headways is expected to have a large shape parameter, which gives the 
distribution the bell-like shape typical of the normal distribution. On the 
other hand, the method of moments gives good parameter estimates for the 
negative exponential distribution also. Consequently, there are grounds for 
using the method of moments to estimate the parameters of the gamma 
distribution. 

The MME’s are obtained by equating the first four moments to the 
sample moments and solving simultaneously the parameter values. Buckley 
(1968) used this method to estimate the parameters. Because the simul¬ 
taneous equations were not explicitly solvable, he used numerical methods. 
He found no physically sensible solutions for flows less than 21 vehicles per 
minute (1,260 veh/h). The complicated form of the equations anticipate 
severe difficulties in finding feasible parameter estimates. 


Modified method of moments estimators To make the estimators 
tractable, and to avoid the use of higher moments the method of moments 
can be modified: Practically all headways in the exponential tail (fj > 6) 
belong to the free headway distribution. The PDF of the follower headways 
is then: 


F\(t) = 1 — e(t), when t > 5 , (5.155) 

where e(t) is small. Now, p and 8 can be estimated using the properties 
of the tail of the headway distribution, assuming that there are enough 
observations in the tail . 

Let us assume a sample of size n with m observations greater than 5. 
The scale parameter estimate 8 is the inverse of the modified sample mean 
of the tail (Wasielewski 1974, Wasielewski 1979): 


6 = 


1 

m 


j=n—m+l 



(5.156) 


where is the jth order statistic of the sample. 
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The probability that a headway belongs to the tail of the distribution is: 

poo 

P {T>6} = J f(t)dt 

= rj s °° h(t)dt+ I" 

= p[l - F,(<5)] + £°Fi(t)ee-"dt. 


(5.157) 


If t > 6 , the follower headway PDF is near unity (Fi(f) ~ 1), and the 
equation simplifies to 


P{r> ' s) = 7 M e ~'“ 


Q 

= (1 — p) ( 1 + - | e 


-65 


(5.158) 


The proportion of headways in the tail can be used as an estimate for 
the probability P{T > 5}. This leads to the following estimator for p: 


p = 1 — 


m 


1 + 


n 


e_ 

0 . 


J,>5 


(5.159) 


The system of equations can be completed by equating the first two moments 
to the corresponding sample moments: 


.a 


™ 1 = t = Pn + (i-p) h; + 


l 


a 


, _d(d + 1) 

m 2 = p- 


(3 \d (3 + 6 

( 2 2d 

+ (1 — p ) ( T ^-— 

/ 3 2 \6 2 6+ ((3 + 6) 


a (d + 1) \ 
C (3 + WJ 


(5.160) 

(5.161) 


Branston (1976) estimated 6 and p from these two equations. He then 
estimated (3 and a by minimizing the chi-square statistic. 

The equation for the first moment can be solved to obtain the estimator 
for the shape parameter: 


- = P(P + 0)[mid- (1 -p)] 

80 + pO) 


(5.162) 


The he scale parameter estimator is the solution to the polynomial equa¬ 
tion, which is obtained by substituting for d in the equation for the second 
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moment: 

[{m'\ - m 2 )0 2 + 1 - p 2 }j3 4 

+ {(1 — 2 p)(m'± — m2)0 2 + [1 — 2p{l — p)\m\9 + 5p(l — p)}0/3 3 
+ {[3m / 2 — (2 + p)m 2 ]0 2 + (4 p — l)m'i9 + 2 — p( 1 — p)}p9 2 (3 2 
+ [(m /2 — m' 2 p)0 2 + (4 p — VjrriiO — p( 1 — p)]i50 3 /3 

+ [mi 6 * - (1 - p)]p 2 0 4 = 0. (5.163) 

This equation has two unknown parameter estimates (p and (3 ). Because 
this is a fourth degree polynomial, its roots can be solved, assuming either 
p or (3 known. There is an analytic solution to the equation with hundreds 
of terms (obtained by the Maple computer algebra system), but it was not 
used, because there are well tested, fast and accurate numerical methods for 
finding the roots of a polynomial. 

Assuming p known, a and [3 can be solved using equations (5.162) and 
(5.163). The estimate is accepted, if the absolute error in the estimate of p 
is smaller than the criterion of precision: 

1 _ n ( 1 + |) e§5 ~P <e - ( 5 - 164 ) 

The problem with this method is that f3 is not a real number for all 
values of p. Unfortunately, this often occurs with such values of p that would 
result in smallest estimation errors. Consequently, the modified method of 
moments was rejected. 


Maximum likelihood estimators The likelihood function of the semi- 
Poisson distribution with gamma distributed follower headways is: 
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The log-likelihood function is: 


In L(p, /3,a,6) = — n In T(cc) + E ln 


3 =1 




+ (1 - p)j(a, fitj) ( 1 + ^ ) Oe 9tj 


(5.166) 


The maximum likelihood estimators are obtained by equating to zero 
the gradient of the log-likelihood function and solving the resulting set of 
equations. This must be done by numerical methods. 

A modified method was also tested, in which the exponential scale pa¬ 
rameter was estimated from the tail using equation (5.156). This method 
decreased the dimensionality of the problem, but there was no significant 
increase in speed and the results were less accurate. Accordingly, the maxi¬ 
mum likelihood method was used without any modifications. 

In some cases, the results were sensitive to the starting point (see 
fig. 5.50). 14 Because the calculations were very time consuming, only one 
starting point was tried. The parameter estimation of sample 59 failed. In 
five samples (26, 38, 45, 46, and 62) p was either unity or null. 

Figures 5.52 and 5.53 show the parameter estimates for the headway 
data, excluding the six samples listed above. On low speed roads there is 
significant correlation between p and volume (r = 0.734, P < 0.0005), and 
between 6 and volume (r = 0.920, P < 0.0005). That is, the proportion of 
follower headways increases with increasing traffic volume. Also, the propor¬ 
tion of short free headways increases. On high speed roads the correlation 
between 6 and the volume is also strong (r = 0.810, P < 0.0005). For p the 
correlation is not as strong, but it is still statistically significant (r = 0.511, 
P = 0.004). 

The parameters of the gamma distribution do not correlate with traffic 
volume, but they have a strong mutual correlation (figure 5.54). The ratio 
P/a is relatively constant, being 0.60 on low speed roads and 0.59 on high 
speed roads. The difference is not statistically significant (P = 0.62), but 
the follower flow rate is about 2,140 veh/h on both low speed and high speed 
roads. The estimate does not change with the overall traffic flow rate. The 
follower flow rate is near the capacity estimate of 2,200 person cars per hour 
per lane on multilane highways (HCM 1994). 


14 Branston (1976) also encountered instability in the parameter estimates. 
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Figure 5.52: Maximum likelihood estimates for p and 9 
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Figure 5.53: Maximum likelihood estimates for j3 and a 
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Figure 5.54: a against (3 


Goodness-of-fit tests 

The parameters were estimated using the maximum likelihood method. 
Sample 59 was excluded, because its parameter estimation failed. The good¬ 
ness of fit was tested using both the nonparametric K-S test and the para¬ 
metric A-D test. The parametric test was conducted using the Monte Carlo 
method described in section 2.2.4. 

For each of the 62 samples, 500 replications were generated and analyzed. 
The number of replications was low, because the parameter estimation was 
very time consuming. 15 If the parameter estimation of a replication failed, a 
new replication was generated. If more than 25 (5 %) failures were recorded, 
the goodness-of-fit test was discarded. Four samples (38, 56, 61, and 64) 
failed the test procedure. 

Figure 5.55 shows the goodness-of-fit test results for the senri-Poisson 
distribution with gamma distributed zone of emptiness. There is an inter¬ 
esting difference in the figures: The significance probabilities of the K-S 
tests are mostly between 0.2 and 0.8, while the A-D tests produce signif¬ 
icance probabilities which are mostly either below 0.3 or above 0.6. The 
comparison of the results (fig. 5.56) shows that the 5 % significance level 
of the parametric A-D test is about the same as the 40 % significance level 
of the nonparametric K-S test. Consequently, the semi-Poisson distribution 

15 It took three weeks to perform the goodness-of-fit tests on three i386/i486 personal 
computers. 
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Figure 5.55: Goodness-of-fit tests for the semi-Poisson distribution 


fits the data extremely well, but it appears to be caused more by the flexi¬ 
bility of the model than by its inherently correct shape. As the flexibility of 
the model increases, the bias in the nonparametric EDF tests also increases. 

The combined probability of the nonparametric K-S tests is 0.85. The 
parametric A-D tests, however, give combined probability of only 4.3-10 -11 . 
The fit is best, when the data have large sample variance. When the variance 
is larger than 100, the significance probability of the sample is greater than 
0.1. Only sample 62 is an exception, but it is probably caused by the 
unsuccessful parameter estimation. On the other hand, all the eight samples 
of at least 500 observations have significance probability less than 0.1. These 
samples are from high speed roads. Accordingly, the semi-Poisson model fits 
best small samples with large variances. 16 In such cases the “noise” in the 
data hides small bias in the model. 

Semi-Poisson distribution in headway studies 

Reasonability The modification of the free headway distribution / 2 (f) in 
the semi-Poisson model can be criticized: The condition that a free head- 


16 The data do not contain large (> 500) samples with large variances (> 100). 
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Figure 5.56: Comparison of the nonparametric K-S and the parametric A-D tests 
for the semi-Poisson distribution 

way should be greater than a random variate from the constrained head¬ 
way distribution (t) seems arbitrary, because the shape of the constrained 
headway distribution is not necessarily the way that the frequency of free 
headways decreases near the origin. The process of becoming a follower is 
not equal to adjusting the headway while being a follower. 

Branston (1976) describes the difference between the senri-Poisson model 
and the M/G/l queuing model as follows: 

In the semi-Poisson model each free headway is obtained by com¬ 
paring an exponential headway with a following headway, while 
in the queuing model each free headway is obtained by adding 
an exponential gap to a following headway. 

This description overstates the difference between these distributions. The 
nonfollower headway variate in the senri-Poisson distribution is also a sum 
of an exponential random variate and another random variate (see equation 
5.138). In fact, the M/G/l model and the senri-Poisson distribution do 
not differ significantly when realistic parameter values are used (Botma & 
Fi 1991). 

Applicability Parameter estimation is time consuming, but senri-Poisson 
pseudorandom numbers can be generated efficiently (Buckley 1962). The 
pdf is rather complicated, and the PDF must be solved numerically. Now 
that numerical integration is widely available in personal computers and 
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even in pocket calculators, there are better potentials for the application of 
the semi-Poisson distribution. The Laplace transform exists, if it exists for 
the follower headway distribution. 

Validity The chi-square tests of Buckley (1968) gave all “good” results. 
Branston (1976) reported an average probability of 0.196 for 16 chi-square 
tests (M4 fast lane). The combined probability, however, is only 0.0004. For 
normal zone of emptiness the results were even worse. 

The senri-Poisson distribution yields a better fit than any of the simple 
distributions. If the K-S tests give as good results as shown above, but the 
hypothesis is still rejected, the distribution is likely to be so flexible that an 
extremely good fit is required. So good a fit may be impossible for headway 
data, where some disturbances cannot always be avoided. 



Chapter 6 

Conclusions and 
recommendations 


The main purpose of the study was to present methods in order to improve 
the statistical analysis of vehicle headways. A more firm scientific foundation 
has been laid for further studies. Also, some new information about the 
properties of vehicle headways on two-lane two-way roads has been obtained. 

The combined probabilities method of Fisher (1938) provided a scientif¬ 
ically sound method to test a hypothesis on a multi-sample data set. The 
method was used extensively throughout the research. It was also demon¬ 
strated that in many earlier headway studies this method would have been 
of great help—either further confirming the results or guarding from false 
conclusions. The combined probabilities method should be routinely used 
in the tests for multi-sample data. 

The combined probabilities method forms the basis of the moving proba¬ 
bility method (Luttinen 1992), which appeared to be very helpful in demon¬ 
strating the variation of test results over traffic volumes. This new method 
provided results not directly available with the traditional methods. It can 
be assumed that the method will have much wider applications in statis¬ 
tics. However, the experience with the method is still limited, and further 
research is needed. 

The first step in the empirical research (fig. 1.1, page 20) is the data 
collection. A major problem in the collection of headway data is to obtain 
stationary data. It was shown that the combination of small samples with 
nearly equal means produces a biased data set. This bias can be avoided, 
if the data collection is based on trend analysis. Tests against trend should 
become a standard procedure in headway studies. 
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In model identification it is important to find a few methods that show 
the most important properties of the data. For the identification of headway 
models a four-stage identification process was proposed. It includes the 
estimation of: 

1. Probability density function 

2. Hazard function 

3. Coefficient of variation (CV-chart) 

4. Kurtosis and squared skewness (KS 2 -chart). 

The probability density function displays the shape of the distribution 
better than the probability distribution function. The hazard function is 
mostly used in life and reliability data analysis, but it also very informative 
in headway studies. The kernel method was shown to produce smooth es¬ 
timates for density and hazard functions. It should be preferred over the 
more traditional histogram method. 

The coefficient of variation is a scaled measure of the variation in head¬ 
ways. It is a useful measure of dispersion in the headway distribution. The 
correlation between the kurtosis and the squared skewness was very high 
in the headway data. A KS 2 -chart displays clearly differences between the 
headway data and statistical models. 

The four-stage identification process allows an effective statistical de¬ 
scription of the headway data. The process also shows many advantages and 
disadvantages of the theoretical statistical distributions when compared to 
the headway data. Moreover, the comparison of data from different sources 
would be easier, if the data were submitted to standard analyses. The four- 
stage identification process is suggested as a framework for future studies. 

The independence of consecutive headways was tested using the auto¬ 
correlation analysis, runs tests, and goodness-of-fit tests for the geometric 
platoon length distribution. The power of the tests was enhanced by com¬ 
bining the significance probabilities of each sample into a single significance 
measure. The variation of significance over traffic volumes was described 
and the evidence made stronger by the moving probability methods. The 
results indicate that the renewal hypothesis should not be accepted in all 
traffic situations, although the possible autocorrelation is weak. 

Concerning the parameter estimation , two properties of the headway 
distributions are worth consideration: 

1. The headway distributions are strongly skewed to the right. 
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2. The headways of trailing vehicles have a practical minimum. 

Because of the skewness of the headway distribution, the method of mo¬ 
ments does not give as good estimates as the maximum likelihood method. 
Because of the practical minimum headway, a location parameter (r) is often 
included in the headway models. In some cases no solutions can be found 
for the maximum likelihood equations of such models. The modified maxi¬ 
mum likelihood method was shown to produce good parameter estimates for 
distributions with a location parameter. 

The main problems of the goodness-of-fit, tests in headway studies are: 

1. The chi-square test is based on grouping of the data. This makes the 
test less powerful than the tests based on the empirical distribution 
function (EDF). 

2. The nonparametric goodness-of-fit tests based on the EDF can be 
applied only, when the parameters of the distribution are known. If 
the parameters of the headway distributions must be estimated from 
the data, the nonparametric tests give too conservative results. 

3. When the parameters are estimated from the data, parametric EDF 
tests should be used. The distribution of the test statistic is, however, 
not known. 

These problems were conquered using parametric EDF test and estimat¬ 
ing the distribution of the test statistic by Monte Carlo methods. The supe¬ 
riority of these tests over the nonparametric Kolmogorov-Smirnov tests was 
demonstrated. The power of the tests was further enhanced by combining 
the significance probabilities of all samples. It is suggested that the goodness 
of fit should be measured by both the nonparametric Kolmogorov-Smirnov 
test and by some parametric EDF test. The parametric Anderson-Darling 
test is recommended. The conclusions should be based on the parametric 
test, but the nonparametric test gives an overall goodness-of-fit measure 
against which different models can be compared. This helps to find out if 
the problem in the model is lack of fit, or flexibility of the model, which 
presents higher requirements for the fit. 

Three principles of evaluation were suggested to evaluate the headway 
distribution models: 

1. Reasonability. It is an advantage if the structure of the model 
is based on explicit theoretical reasoning about the characteristics of 
traffic flow. 
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2. Applicability. The model should have a simple structure to avoid 
insurpassable problems in mathematical analysis. If simulation is con¬ 
sidered, the generation of pseudo-random variates should be fast and 
reliable. Parameter estimation should not be too complicated. 

3. Validity. The model should give a good approximation of the real 
world phenomena. This can be tested first by the four-stage identifi¬ 
cation process and finally by the goodness-of-fit tests. 

Five statistical distributions were evaluated as models for the headway 
distributions on two-lane roads. The results of the goodness-of-fit tests 1 are 
presented in table 6.1. 


Table 6.1: Summary of the goodness-of-fit tests 


Distribution 

P(K-S) 

P(A-D) 

Negative exponential 

2.1 • 10” iti2 

2.9 • 

10 -15u 

Shifted exponential 

3.8 • 10~ 157 

5.3 • 

l 0 -!50 

Gamma 

8.7 • 10 -106 

4.1 • 

io- 141 

Lognormal 

1.8- 10" 36 

1.8 • 

10-H4 

Semi-Poisson 

0.85 

4.3- 

10 -11 


The exponential distribution can be considered as a model for vehicle 
headways under extremely low flow conditions, and for applications that 
are not very sensitive to the shape of the headway distribution. In some 
cases the analytical solution of the problem may only be possible with a 
very simple model. In such situations the limitations of the model should 
be clearly indicated. 

The shifted exponential distribution does not produce extremely short 
headways, which is a major problem in the negative exponential distribu¬ 
tion. The goodness of fit is, however, not significantly better. The shifted 
exponential distribution can be used in similar situations as the negative 
exponential distribution, especially when it is important to set a minimum 
headway. 

The gamma distribution can be used, if the distribution should have a 
simple form and low probability for short headways, and the results are not 
too sensitive to other properties of the distribution. Under low-to-moderate 

1 The precision of the Kolmogorov-Smirnov tests was four decimals. If a test result was 
0.0000, the combined probability was calculated using 0.000025 in stead. 
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traffic volumes the use of the gamma distribution can be given preference 
over the exponential distribution. 

The lognormal distribution falls between the simple-nonrealistic and 
complex-realistic models. The goodness-of-fit results are better than for 
the other simple distributions, but mathematical analysis presents some 
problems. There are, however, some theoretical reasons to consider the 
lognormal distribution as a model for the follower headway distribution 
(Daou 1966, Greenberg 1966). 

The semi-Poisson distribution can be recommended for demanding ap¬ 
plications with adequate computational facilities. It produced the best test 
results, but even it failed the parametric test. 

The data in this study are from Finnish two-lane two-way roads. Such 
factors as speed limit, passing sight distance, lane and shoulder width, climb¬ 
ing lanes, proportion of heavy vehicles, flow rate of opposite traffic, traffic 
signals, distance to major traffic generators, and intersection density have 
an effect on the headway distribution. Our knowledge of these relationships 
is, however, very limited. The headway distribution model should be very 
adaptive, but not too flexible. It should have all the right parameters, but 
none too many. In this respect all the work toward a unified traffic flow 
theory is most welcome. 

The proposed procedures give a scientific foundation to identify and 
estimate statistical models for vehicle headways, and to test the goodness of 
fit. It has been shown that the statistical methods in the analysis of vehicle 
headways should be thoroughly revised following the guidelines presented in 
this text. 
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APPENDIX A. DATA DESCRIPTION 


Table A.l: Headway samples from 80-100 km/h roads 


No. 

SL 

n 

Volume 

s 2 

C 

a 3 

CK4 

1 

100 

500 

912 

36.91 

1.54 

3.89 

20.31 

2 

100 

250 

967 

28.90 

1.44 

3.61 

17.61 

3 

100 

400 

1068 

18.04 

1.26 

3.64 

20.24 

4 

100 

400 

945 

30.75 

1.46 

4.49 

30.63 

5 

100 

391 

927 

26.34 

1.32 

3.48 

16.50 

6 

100 

125 

262 

725.61 

1.96 

3.71 

19.83 

7 

100 

190 

352 

247.15 

1.54 

2.94 

13.98 

8 

100 

90 

439 

332.74 

2.23 

4.81 

31.04 

9 

100 

75 

257 

805.51 

2.03 

3.37 

15.62 

10 

100 

180 

979 

23.07 

1.31 

4.13 

25.46 

11 

100 

420 

1282 

13.68 

1.32 

5.40 

44.68 

12 

100 

153 

1837 

1.33 

0.59 

2.95 

16.34 

13 

100 

500 

1410 

8.74 

1.16 

4.21 

25.59 

14 

100 

350 

1462 

4.64 

0.87 

3.79 

22.70 

15 

100 

250 

1388 

5.94 

0.94 

2.89 

13.11 

16 

100 

180 

1250 

15.01 

1.34 

3.81 

18.42 

17 

100 

600 

1056 

16.46 

1.19 

3.46 

17.79 

18 

100 

500 

1145 

13.82 

1.18 

4.22 

28.33 

19 

100 

200 

1230 

13.36 

1.25 

4.91 

30.65 

20 

100 

440 

1079 

12.84 

1.07 

2.98 

14.02 

21 

100 

200 

1118 

12.80 

1.11 

3.66 

20.70 

22 

80 

100 

207 

534.03 

1.33 

1.96 

7.40 

23 

80 

200 

1255 

8.63 

1.02 

3.66 

18.49 

24 

80 

500 

1451 

7.62 

1.11 

7.31 

77.03 

25 

80 

900 

1439 

6.13 

0.99 

5.16 

41.87 

26 

80 

114 

1459 

1.91 

0.56 

2.03 

9.18 

27 

80 

700 

1330 

3.29 

0.67 

3.34 

21.93 

28 

80 

370 

1245 

5.19 

0.79 

3.59 

22.92 

29 

80 

600 

1467 

3.71 

0.78 

8.76 

132.67 

30 

80 

226 

1000 

44.65 

1.86 

5.40 

37.66 

31 

80 

250 

746 

57.48 

1.57 

3.93 

21.83 

32 

80 

390 

795 

55.47 

1.64 

4.03 

22.02 

33 

80 

210 

361 

175.94 

1.33 

2.20 

8.58 
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Table A.2: Headway samples from 50-70 km/h roads 


No. 

SL 

n 

Volume 

s 2 

C 

a 3 

Ot 4 

34 

50 

200 

328 

160.87 

1.16 

2.05 

8.50 

35 

50 

170 

367 

156.78 

1.28 

2.49 

10.54 

36 

50 

200 

637 

48.95 

1.24 

2.84 

13.40 

37 

50 

251 

450 

80.59 

1.12 

1.62 

5.16 

38 

50 

150 

355 

120.85 

1.08 

1.83 

6.17 

39 

50 

167 

429 

78.92 

1.06 

1.84 

6.71 

40 

50 

100 

459 

70.88 

1.07 

1.79 

6.15 

41 

50 

200 

650 

57.29 

1.37 

4.12 

24.56 

42 

50 

150 

794 

25.49 

1.11 

3.20 

16.12 

43 

50 

350 

1291 

3.55 

0.68 

2.11 

8.08 

44 

50 

300 

662 

34.24 

1.08 

2.56 

10.89 

45 

50 

100 

158 

369.72 

0.85 

1.38 

5.09 

46 

50 

76 

148 

465.14 

0.89 

1.41 

4.37 

47 

60 

150 

613 

29.76 

0.93 

2.09 

8.58 

48 

60 

240 

764 

19.31 

0.93 

1.93 

7.42 

49 

60 

250 

646 

40.13 

1.14 

2.50 

10.90 

50 

60 

112 

533 

47.72 

1.02 

1.27 

3.62 

51 

60 

320 

885 

18.33 

1.05 

2.47 

9.47 

52 

60 

150 

1252 

8.12 

0.99 

3.99 

22.66 

53 

60 

260 

915 

22.34 

1.20 

3.32 

17.42 

54 

60 

220 

835 

27.51 

1.22 

2.78 

11.97 

55 

60 

100 

309 

218.68 

1.27 

1.75 

5.55 

56 

60 

140 

542 

58.50 

1.15 

1.96 

7.15 

57 

60 

140 

479 

95.84 

1.30 

2.16 

7.94 

58 

60 

290 

561 

62.95 

1.24 

2.33 

9.28 

59 

70 

110 

332 

200.75 

1.30 

3.12 

16.09 

60 

70 

100 

370 

154.69 

1.28 

2.27 

8.12 

61 

70 

120 

324 

156.80 

1.13 

2.06 

8.14 

62 

70 

150 

143 

1001.61 

1.26 

1.72 

5.90 

63 

70 

100 

238 

324.00 

1.19 

1.46 

4.41 

64 

70 

160 

298 

153.24 

1.02 

1.46 

4.81 

65 

70 

300 

475 

72.28 

1.12 

1.83 

6.05 



